by OpenAI· Released April 2025· Cutoff June 2024
GPT-4.1 Mini is a cost-efficient, low-latency model in the GPT-4.1 family, designed for fast and affordable inference while maintaining strong reasoning capabilities. It offers a 1M token context window and supports multimodal inputs (text and images), making it suitable for high-volume, real-time applications.
Input cost
$0.40 per 1M tokens
Output cost
$1.60 per 1M tokens
Context window
1M tokens
Max output
32768 tokens
Modalities
License
proprietary
High-volume, real-time applications requiring low latency and cost efficiency with strong reasoning and long context support.