by Alibaba· Released May 2025· Cutoff April 2025
Qwen3 0.6B is a compact, efficient language model from Alibaba's Qwen3 series, designed for lightweight deployment and fast inference. It balances performance with low resource requirements, making it suitable for edge devices and real-time applications. As the smallest model in the Qwen3 lineup, it offers strong capabilities for its size.
Input cost
Free (open source)
Output cost
Free (open source)
Context window
32K tokens
Max output
8192 tokens
Modalities
Parameters
0.6B
License
Apache-2.0
Lightweight, fast inference tasks on resource-constrained devices or cost-sensitive applications.