by Google· Released May 2025
Gemini 2.5 Pro TTS Preview is a multimodal model from Google that adds text-to-speech (TTS) capabilities to the Gemini 2.5 Pro reasoning model. It can generate spoken audio responses from text, enabling natural voice interactions. This preview model is designed for applications requiring high-quality, expressive speech synthesis.
Input cost
$1.25 per 1M tokens
Output cost
$10.00 per 1M tokens
Context window
1M tokens
Max output
8192 tokens
Modalities
License
proprietary
Generating natural, expressive speech from text for conversational AI and voice applications.