All Models
NVIDIA AI Models
Builds Nemotron models optimized for NVIDIA hardware. Leading GPU maker powering most AI training.
Nemotron Nano 9B V2
NVIDIA
NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks.
Context131K
Speed115 tok/s
InputText
OutputText
ReasoningYes
Llama 3.3 Nemotron Super 49B V1.5
NVIDIA
Llama-3.3-Nemotron-Super-49B-v1.5 is a 49B-parameter, English-centric reasoning/chat model derived from Meta’s Llama-3.3-70B-Instruct with a 128K context.
Context131K
Speed75 tok/s
InputText
OutputText
ReasoningYes
Llama 3.1 Nemotron 70B Instruct
NVIDIA
NVIDIA's Llama 3.1 Nemotron 70B is a language model designed for generating precise and useful responses.
Context131K
Speed29 tok/s
InputText
OutputText
ReasoningNo