by Google· Released May 2024· Cutoff Early 2024
Gemini 1.5 Flash is a lightweight, fast, and cost-efficient multimodal model designed for high-volume, latency-sensitive applications. It is optimized for tasks like summarization, chat, and image/video captioning, offering a balance of performance and speed. As part of the Gemini 1.5 family, it supports a 1 million token context window and native multimodal inputs.
Input cost
$0.075 per 1M tokens
Output cost
$0.30 per 1M tokens
Context window
1M tokens
Max output
8192 tokens
Modalities
License
proprietary
High-volume, latency-sensitive applications requiring fast and cost-effective multimodal processing.