All Models

NVIDIA AI Models

Builds Nemotron models optimized for NVIDIA hardware. Leading GPU maker powering most AI training.

Founded 1993Santa Clara, CA3 Models Website →
NVIDIA logo

Nemotron Nano 9B V2

NVIDIA

NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks.

Context131K
Speed115 tok/s
InputText
OutputText
ReasoningYes
NVIDIA logo

Llama 3.3 Nemotron Super 49B V1.5

NVIDIA

Llama-3.3-Nemotron-Super-49B-v1.5 is a 49B-parameter, English-centric reasoning/chat model derived from Meta’s Llama-3.3-70B-Instruct with a 128K context.

Context131K
Speed75 tok/s
InputText
OutputText
ReasoningYes
NVIDIA logo

Llama 3.1 Nemotron 70B Instruct

NVIDIA

NVIDIA's Llama 3.1 Nemotron 70B is a language model designed for generating precise and useful responses.

Context131K
Speed29 tok/s
InputText
OutputText
ReasoningNo