by Meta· Released September 2024· Cutoff August 2024
Llama 3.2 11B is a multimodal model that supports text and image inputs, enabling tasks like visual reasoning and document understanding. It is part of Meta's Llama 3.2 family, offering a balance of performance and efficiency for on-device and cloud applications. This model is open-source and optimized for instruction following and safety.
Input cost
Free (open source)
Output cost
Free (open source)
Context window
128K tokens
Max output
4096 tokens
Modalities
Parameters
11B
License
Llama 3.2 Community License
Multimodal tasks requiring visual reasoning and text generation with a compact, efficient model.