by Mistral AI· Released November 2024· Cutoff August 2024
Pixtral Large is Mistral AI's most advanced multimodal model, combining a 124 billion parameter decoder with a dedicated vision encoder. It excels at understanding text, images, and documents, and is designed for complex reasoning tasks that require both visual and textual understanding.
Input cost
$2.00 per 1M tokens
Output cost
$6.00 per 1M tokens
Context window
128K tokens
Max output
—
Modalities
Parameters
124B
License
proprietary
Complex multimodal reasoning tasks involving images, documents, and text, such as chart analysis, document QA, and visual question answering.