by Google· Released May 2025
Gemini 2.5 Flash TTS Preview is a text-to-speech model that generates natural-sounding speech from text input. It is part of the Gemini 2.5 Flash family, optimized for low-latency, high-quality audio generation. This preview model allows developers to integrate expressive speech synthesis into applications.
Input cost
—
Output cost
—
Context window
—
Max output
—
Modalities
License
proprietary
Generating natural-sounding speech from text for real-time applications.