by Google· Released February 2025· Cutoff January 2025
Gemini 2.0 Flash Lite is a cost-efficient, low-latency model optimized for high-volume, text-only tasks. It offers a 1M token context window and is designed to be the most affordable option in the Gemini 2.0 Flash family, ideal for scaling AI applications.
Input cost
$0.075 per 1M tokens
Output cost
$0.30 per 1M tokens
Context window
1M tokens
Max output
8192 tokens
Modalities
License
proprietary
High-volume, text-only applications requiring low cost and low latency.