OpenAI Audio & Vision: Whisper and GPT-4V · Lesson 5
GPT-4V: Image Tagging and Captioning
Use GPT-4o-mini vision to generate tags and descriptions for e-commerce products. Deduplicate tags via embedding cosine similarity and turn descriptions into concise captions using few-shot prompting.