by Google· Released May 2025· Cutoff January 2025
Gemini 2.5 Flash is a cost-efficient, low-latency multimodal model designed for high-volume production workloads. It balances speed and quality, offering strong reasoning capabilities with reduced cost compared to Gemini 2.5 Pro. It supports text, image, audio, and video inputs with a 1M token context window.
Input cost
$0.15 per 1M tokens
Output cost
$0.60 per 1M tokens
Context window
1M tokens
Max output
8192 tokens
Modalities
License
proprietary
High-volume, cost-sensitive applications requiring fast responses and multimodal input processing.