Llama 3.1 Nemotron Instruct 70B
NVIDIALlamaOpen WeightLlama 3.1 Community License
Description
A large language model customized by NVIDIA to improve the helpfulness of LLM generated responses. It is a fine-tuned version of Llama 3.1 70B Instruct. The model was trained using RLHF (REINFORCE) with HelpSteer2-Preference prompts.
Release Date
2024-10-15
Parameters
70.0B
Context Length
131K
Modalities
text
Capability Radar
29
general
14
coding
27
reasoning
30
scienceest.
0
agents
0
multimodal
Science uses a reasoning proxy when dedicated science benchmarks are unavailable.
Rankings
| Domain | #Rank | Score | Source |
|---|---|---|---|
| Code Ranking | 391 | 14.0 | AA |
| General Ranking | 355 | 31.0 | AA |
| Math Reasoning | 282 | 26.0 | AA |
| Reasoning | 18 | 86.0 | LS |
| Science | 346 | 31.0 | AA |
Benchmark Scores (LLM Stats)
Communication
MT-Bench
0.09 / 100SR
Finance
MMLU Chat
80.6%SR
MMLU
80.2%SR
TruthfulQA
58.6%SR
General
Instruct HumanEval
73.8%SR
ARC-C
69.2%SR
Language
Winogrande
84.5%SR
XLSum English
31.6%SR
Math
GSM8k
91.4%SR
GSM8K Chat
81.9%SR
Reasoning
HellaSwag
85.6%SR
AA Evaluation Indices
Intelligence Index13.4
Math Index11.0
Coding Index10.8
Math 5000.7
Mmlu Pro0.7
Gpqa0.5
Ifbench0.3
Aime0.2
Scicode0.2
Tau20.2
Livecodebench0.2
Aime 250.1
Lcr0.1
Hle0.0
Terminalbench Hard0.0
LLM Stats Category Scores
Math90
Language80
Finance70
Healthcare70
Legal70
Reasoning70
General50
Communication10
Creativity10
Roleplay10
Pricing
Input Price$1.2 / 1M tokens
Output Price$1.2 / 1M tokens
Blended Price (3:1)$1.2 / 1M tokens
Speed
Tokens/sec38.1 tokens/s
Time to First Token0.34s
Time to Answer0.34s
Available Providers
(LS internal units)No provider data available