Llama 3.1 Nemotron 70B Instruct

Name: Llama 3.1 Nemotron 70B Instruct
Brand: NVIDIA
Price: 1.2 USD

70B

by NVIDIA

NVIDIA's Llama 3.1 Nemotron 70B is a language model designed for generating precise and useful responses. Leveraging [Llama 3.1 70B](/models/meta-llama/llama-3.1-70b-instruct) architecture and Reinforcement Learning from Human Feedback (RLHF), it excels in automatic alignment benchmarks. This model is tailored for applications requiring high accuracy in helpfulness and response generation, suitable for diverse user queries across multiple domains. Usage of this model is subject to [Meta's Acceptable Use Policy](https://www.llama.com/llama3/use-policy/).

Chat with Llama 3.1 Nemotron 70B Instruct

Input Price$1.20/1M tokens

Output Price$1.20/1M tokens

Intelligence13.4

Coding10.8

Specifications

Technical details and pricing.

ProviderNVIDIA

Context Window131,072 tokens

Release DateOct 15, 2024

ModalitiesText

Benchmarks

12 benchmark scores from Artificial Analysis.

GPQA46.5%

MMLU Pro69.0%

HLE4.6%

LiveCodeBench16.9%

MATH 50073.3%

AIME 202511.0%

AIME24.7%

SciCode23.3%

LCR7.0%

IFBench30.7%

Tau223.1%

TerminalBench Hard4.5%

Composite Indices

Intelligence, Coding, Math

Standard Benchmarks

Academic and industry benchmarks

Frequently Asked Questions

What is Llama 3.1 Nemotron 70B Instruct good for?

Use Llama 3.1 Nemotron 70B Instruct for everyday tasks like writing, summarizing, brainstorming, and getting clear explanations.

How much does Llama 3.1 Nemotron 70B Instruct cost?

Pricing is based on usage. Current rates are $1.20/1M tokens for input and $1.20/1M tokens for output.

Can I try Llama 3.1 Nemotron 70B Instruct for free?

Yes. You can start a chat instantly and test the model before deciding on a plan.

Does Llama 3.1 Nemotron 70B Instruct support images or audio?

Llama 3.1 Nemotron 70B Instruct focuses on text-based tasks.

Similar Models

Other models you might want to explore.

Llama 3.3 Nemotron Super 49B V1.5

NVIDIA

Llama-3.3-Nemotron-Super-49B-v1.5 is a 49B-parameter, English-centric reasoning/chat model derived from Meta’s Llama-3.3-70B-Instruct with a 128K context.

Details →

Nemotron 3 Nano 30B A3B (free)

NVIDIA

NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficiency and accuracy for developers to build specialized agentic AI systems.

Details →

Nemotron Nano 9B V2

NVIDIA

NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks.

Details →

Benchmarks and pricing are sourced from Artificial Analysis where available. OpenRouter specs are used as a fallback.