by Google· Released May 2025
Gemini 3.1 Flash Live is a low-latency multimodal model from Google optimized for real-time interactions, including live audio and video streaming. It is part of the Gemini 3.1 Flash family, designed for fast, cost-effective performance with a 1M token context window.
Input cost
$0.15 per 1M tokens
Output cost
$0.60 per 1M tokens
Context window
1M tokens
Max output
8192 tokens
Modalities
License
proprietary
Real-time multimodal applications requiring low latency, such as live video analysis and interactive voice assistants.