Llama 3.1 Nemotron Ultra 253B

[MODEL]

Llama 3.1 Nemotron Ultra 253B

id: nvidia/llama-3.1-nemotron-ultra-253b
family: nemotron
context: 128,000
modalities: text → text
release: 2025-04-07
reasoning tools vision structured temp open weights

Offerings

Provider Input Output Context Max out
LLM Gateway
llama-3.1-nemotron-ultra-253b
$0.60/M $1.80/M 128,000 8,192

Similar matches

These provider models look related but are not exact matches.

Provider Input Output Context Max out
Baseten
nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B
$0.60/M $2.40/M 202,800 202,800
Nebius Token Factory
nvidia/Llama-3_1-Nemotron-Ultra-253B-v1
$0.60/M $1.80/M 128,000 4,096