Gemini API & Google AI
Hands-on course on Google Gemini API: from first request to multimodal apps with vision, audio, and video. Gemini 2.0 Flash, Gemini 1.5 Pro, Live API, function calling, Search grounding, code execution, context caching, and Vertex AI enterprise deployment. Real code in Python and JavaScript.
Getting Started with Gemini API
API key, SDK install, your first generateContent call, model selection, and core generation parameters.
Multimodal Capabilities
Vision, audio, and video as input: inline base64, File API, image reasoning, transcription, and video understanding.
Advanced Generation Features
Function calling, Google Search grounding, and built-in code execution — extending the model with external capabilities.
Production Patterns
Streaming, context caching, and the Live API — what real apps under load need.
Enterprise & Ecosystem
Vertex AI for enterprise and the broader Google AI ecosystem: SDKs, Gemma, embeddings.