by Alibaba· Released May 2025· Cutoff April 2025
Qwen3 8B is a 8-billion-parameter large language model from Alibaba's Qwen3 series, featuring a Mixture-of-Experts (MoE) architecture with 2.2B activated parameters. It supports extended thinking (reasoning) and is optimized for efficiency, outperforming similarly sized dense models on benchmarks like MMLU and coding tasks.
Input cost
Free (open source)
Output cost
Free (open source)
Context window
32K tokens
Max output
8192 tokens
Modalities
Parameters
8B (2.2B activated)
License
Apache-2.0
Efficient reasoning and coding tasks with a balance of performance and low computational cost.