by Google
Gemini 3 Flash is a lightweight, fast, and cost-efficient model from Google, optimized for high-throughput tasks and real-time applications. It balances performance and speed, making it ideal for simple reasoning, summarization, and chat use cases. As part of the Gemini 3 family, it offers lower latency and reduced pricing compared to larger models.
Input cost
$0.10 per 1M tokens
Output cost
$0.40 per 1M tokens
Context window
1M tokens
Max output
8192 tokens
Modalities
License
proprietary
High-speed, low-cost tasks such as real-time chat, summarization, and simple code generation.