Hermes 3 - Llama-3.1 70B
Nous ResearchLlamaOpen WeightApache 2.0 · Commercial OK
Description
Hermes 3 70B is Nous Research's flagship instruction-following model, fine-tuned for advanced reasoning, creative writing, and complex task completion. It features exceptional instruction adherence and strong performance across multiple domains.
Release Date
2024-08-15
Parameters
70.0B
Context Length
131K
Modalities
text
Capability Radar
21
general
20
coding
25
reasoning
27
scienceest.
24
agents
0
multimodal
Science uses a reasoning proxy when dedicated science benchmarks are unavailable.
Rankings
| Domain | #Rank | Score | Source |
|---|---|---|---|
| Code Ranking | 371 | 20.0 | AA |
| General Ranking | 413 | 25.0 | AA |
| Math Reasoning | 279 | 27.0 | AA |
| Reasoning | 48 | 70.0 | LS |
| Science | 401 | 27.0 | AA |
Benchmark Scores (LLM Stats)
Biology
GPQA
66.1%SR
Communication
MT-Bench
8.99 / 100SR
Finance
MMLU
79.1%SR
TruthfulQA
63.3%SR
MMLU-Pro
47.2%SR
General
PIQA
84.4%SR
ARC-E
83.0%SR
IFBench
81.2%SR
ARC-C
65.5%SR
AGIEval
56.2%SR
OpenBookQA
49.4%SR
Language
BoolQ
88.0%SR
Winogrande
83.2%SR
BBH
67.8%SR
Math
MATH
20.8%SR
Reasoning
HellaSwag
88.2%SR
MuSR
50.7%SR
AA Evaluation Indices
Intelligence Index5.1
Mmlu Pro0.6
Math 5000.5
Gpqa0.4
Scicode0.2
Livecodebench0.2
Hle0.0
Aime0.0
LLM Stats Category Scores
Roleplay9
Communication9
Creativity9
General1
Reasoning1
Instruction Following80
Physics80
Language70
Biology70
Chemistry70
Legal60
Finance60
Healthcare60
Math50
Pricing
Input Price$0.3 / 1M tokens
Output Price$0.3 / 1M tokens
Blended Price (3:1)$0.3 / 1M tokens
Speed
Tokens/sec30.1
Time to First Token0.35s
Time to Answer0.35s
Provider Price Ranking
Provider Price Ranking
4 providers
Cheapest: Nous ResearchMost Expensive: OpenRouter
ProviderInputOutput
1Nous ResearchPRIMARY
$0.3
$0.3
2Kilo Gateway
$0.3
$0.3
3NanoGPT
$0.408
$0.408
4OpenRouter
$0.7
$0.7
Compare pricing across different API providers for this model.