by Mistral AI· Released December 2023· Cutoff December 2023
Mixtral 8x7B is a sparse mixture-of-experts (MoE) model that achieves high performance with low latency by activating only 12.9B parameters per token. It is Mistral's first open-weight MoE model, offering strong multilingual capabilities and a 32K token context window.
Input cost
Free (open source)
Output cost
Free (open source)
Context window
32K tokens
Max output
—
Modalities
Parameters
46.7B total (12.9B active per token)
License
Apache-2.0
High-quality text generation and code tasks with efficient inference via MoE architecture.