Llama 3.1 Nemotron Instruct 70B
NVIDIALlamaOpen WeightLlama 3.1 Community License
Description
A large language model customized by NVIDIA to improve the helpfulness of LLM generated responses. It is a fine-tuned version of Llama 3.1 70B Instruct. The model was trained using RLHF (REINFORCE) with HelpSteer2-Preference prompts.
Release Date
2024-10-15
Parameters
70.0B
Context Length
—
Modalities
—
Capability Radar
26
general
18
coding
27
reasoning
30
scienceest.
24
agents
0
multimodal
Science uses a reasoning proxy when dedicated science benchmarks are unavailable.
Rankings
| Domain | #Rank | Score | Source |
|---|---|---|---|
| Code Ranking | 436 | 11.0 | AA |
| General Ranking | 378 | 29.0 | AA |
| Math Reasoning | 282 | 26.0 | AA |
| Reasoning | 18 | 86.0 | LS |
| Science | 373 | 30.0 | AA |
Benchmark Scores (LLM Stats)
Communication
MT-Bench
0.09 / 100SR
Finance
MMLU Chat
80.6%SR
MMLU
80.2%SR
TruthfulQA
58.6%SR
General
Instruct HumanEval
73.8%SR
ARC-C
69.2%SR
Language
Winogrande
84.5%SR
XLSum English
31.6%SR
Math
GSM8k
91.4%SR
GSM8K Chat
81.9%SR
Reasoning
HellaSwag
85.6%SR
AA Evaluation Indices
Math Index11.0
Intelligence Index7.6
Math 5000.7
Mmlu Pro0.7
Gpqa0.5
Ifbench0.3
Aime0.2
Scicode0.2
Tau20.2
Livecodebench0.2
Aime 250.1
Lcr0.1
Hle0.0
Terminalbench Hard0.0
LLM Stats Category Scores
Math90
Language80
Legal70
Reasoning70
Finance70
Healthcare70
General50
Roleplay10
Communication10
Creativity10
Pricing
Input Price$1.2 / 1M tokens
Output Price$1.2 / 1M tokens
Blended Price (3:1)$1.2 / 1M tokens
Speed
Tokens/sec295.6
Time to First Token4.91s
Time to Answer4.91s
Provider Price Ranking
Provider Price Ranking
2 providers
Cheapest: NanoGPTMost Expensive: NVIDIA
ProviderInputOutput
1NanoGPTCheapest
$0.357
$0.408
2NVIDIAPRIMARY
$1.2
$1.2
Compare pricing across different API providers for this model.