Skip to main content

DeepHermes 3 - Mistral 24B Preview (Non-reasoning)

Nous ResearchMistral
Release Date
2025-03-13
Parameters
Context Length
Modalities

Capability Radar

21
general
20
coding
28
reasoning
26
scienceest.
26
agents
60
multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain#RankScoreSource
Code Ranking353
21.0
AA
General Ranking408
25.0
AA
Math Reasoning263
31.0
AA
Science404
26.0
AA

Benchmark Scores (LLM Stats)

Biology

GPQA71.2%SR

Code

LiveCodeBench63.6%SR

Creativity

Arena Hard58.3%SR

Finance

MMLU-Pro78.0%SR

General

MMMU-Pro60.0%SR
IFBench48.0%SR

Language

COLLIE62.9%SR

Long Context

AA-LCR71.2%SR

Math

AIME 202583.8%SR

AA Evaluation Indices

Intelligence Index
5.3
Math 500
0.6
Mmlu Pro
0.6
Gpqa
0.4
Scicode
0.2
Livecodebench
0.2
Aime
0.0
Hle
0.0

LLM Stats Category Scores

Legal
80
Math
80
Finance
80
Healthcare
80
Language
70
Long Context
70
Physics
70
Reasoning
70
Biology
70
Chemistry
70
Multimodal
60
General
60
Code
60
Creativity
60
Vision
60
Writing
60
Instruction Following
50

Pricing

Input PriceFree
Output PriceFree
Blended Price (3:1)Free

Speed

Tokens/sec0.0
Time to First Token0.00s
Time to Answer0.00s

Provider Price Ranking

Provider Price Ranking

2 providers

Cheapest: ChutesMost Expensive: NanoGPT
ProviderInputOutput
1ChutesCheapest
$0.0245
$0.0978
2NanoGPT
$0.3
$0.3

Compare pricing across different API providers for this model.

External Sources