by Alibaba· Released June 2024· Cutoff March 2024
Qwen2 1.5B is a small, efficient language model from Alibaba's Qwen2 series, designed for lightweight deployment and fast inference. It balances performance and resource usage, making it suitable for edge devices and applications with limited compute. As the smallest model in the Qwen2 family, it offers strong multilingual capabilities and supports a 32K token context window.
Input cost
Free (open source)
Output cost
Free (open source)
Context window
32K tokens
Max output
8192 tokens
Modalities
Parameters
1.5B
License
Apache-2.0
Lightweight, fast inference tasks on resource-constrained devices or applications requiring low latency.