by NVIDIA· Released October 2024· Cutoff December 2023
Llama 3.1 Nemotron Ultra 253B is a large language model developed by NVIDIA, based on Meta's Llama 3.1 architecture with enhancements for improved reasoning and instruction following. It is part of NVIDIA's Nemotron model family, designed for enterprise-grade AI applications requiring high accuracy and reliability.
Input cost
$5.00 per 1M tokens
Output cost
$15.00 per 1M tokens
Context window
128K tokens
Max output
4096 tokens
Modalities
Parameters
253B
License
proprietary
Enterprise applications requiring high-quality reasoning, code generation, and instruction following with a large context window.