by OpenAI· Released October 2024· Cutoff October 2023
GPT-4o-mini-realtime-preview is a cost-efficient, low-latency multimodal model optimized for real-time voice and text interactions. It supports audio streaming and function calling, making it ideal for conversational AI applications. As a smaller variant of GPT-4o, it balances performance and affordability for production use.
Input cost
$0.60 per 1M tokens
Output cost
$2.40 per 1M tokens
Context window
128K tokens
Max output
4096 tokens
Modalities
License
proprietary
Real-time voice and text conversational AI applications requiring low latency and cost efficiency.