NVIDIA Models
NVIDIA logo

Llama 3.1 Nemotron 70B Instruct

70B

by NVIDIA

NVIDIA's Llama 3.1 Nemotron 70B is a language model designed for generating precise and useful responses. Leveraging [Llama 3.1 70B](/models/meta-llama/llama-3.1-70b-instruct) architecture and Reinforcement Learning from Human Feedback (RLHF), it excels in automatic alignment benchmarks. This model is tailored for applications requiring high accuracy in helpfulness and response generation, suitable for diverse user queries across multiple domains. Usage of this model is subject to [Meta's Acceptable Use Policy](https://www.llama.com/llama3/use-policy/).

Chat with Llama 3.1 Nemotron 70B Instruct
Input Price$1.20/1M tokens
Output Price$1.20/1M tokens
Intelligence13.4
Coding10.8

Specifications

Technical details and pricing.

ProviderNVIDIA
Context Window131,072 tokens
Release DateOct 15, 2024
ModalitiesText

Benchmarks

12 benchmark scores from Artificial Analysis.

GPQA46.5%
MMLU Pro69.0%
HLE4.6%
LiveCodeBench16.9%
MATH 50073.3%
AIME 202511.0%
AIME24.7%
SciCode23.3%
LCR7.0%
IFBench30.7%
Tau223.1%
TerminalBench Hard4.5%

Composite Indices

Intelligence, Coding, Math

Standard Benchmarks

Academic and industry benchmarks

Frequently Asked Questions

What is Llama 3.1 Nemotron 70B Instruct good for?

Use Llama 3.1 Nemotron 70B Instruct for everyday tasks like writing, summarizing, brainstorming, and getting clear explanations.

How much does Llama 3.1 Nemotron 70B Instruct cost?

Pricing is based on usage. Current rates are $1.20/1M tokens for input and $1.20/1M tokens for output.

Can I try Llama 3.1 Nemotron 70B Instruct for free?

Yes. You can start a chat instantly and test the model before deciding on a plan.

Does Llama 3.1 Nemotron 70B Instruct support images or audio?

Llama 3.1 Nemotron 70B Instruct focuses on text-based tasks.

Benchmarks and pricing are sourced from Artificial Analysis where available. OpenRouter specs are used as a fallback.