by OpenAI· Released October 2024· Cutoff October 2023
GPT-4o-realtime-preview is a multimodal model from OpenAI designed for low-latency, real-time interactions, supporting text, audio, and vision inputs. It is optimized for voice conversations and live applications, offering near-instantaneous responses. This model is part of the GPT-4o family, combining advanced reasoning with real-time capabilities.
Input cost
$5.00 per 1M tokens
Output cost
$20.00 per 1M tokens
Context window
128K tokens
Max output
4096 tokens
Modalities
License
proprietary
Real-time voice and multimodal applications requiring low latency, such as live assistants, customer support, and interactive agents.