by Google· Released September 2024· Cutoff September 2024
Gemini 1.5 Flash 002 is a fast and efficient multimodal model from Google, optimized for high-volume, low-latency tasks. It balances performance and cost, making it ideal for applications requiring quick responses across text, image, audio, and video inputs. As part of the Gemini 1.5 family, it offers a large context window and competitive pricing.
Input cost
$0.075 per 1M tokens
Output cost
$0.30 per 1M tokens
Context window
1,048,576 tokens
Max output
8192 tokens
Modalities
License
proprietary
High-volume, low-latency applications requiring multimodal understanding and fast responses.