OpenAI Fine-tuning: From Data to DPO · Lesson 4
Model Distillation: gpt-4o → gpt-4o-mini
Distillation uses store=True in Chat Completions to capture large-model outputs, then fine-tunes a smaller model on that data — reducing cost and latency without sacrificing accuracy.