Skip to main content

Devstral 2

MistralMistral
Release Date
2025-12-09
Parameters
Context Length
262K
Modalities
text

Capability Radar

32
general
42
coding
40
reasoning
39
scienceest.
41
agents
0
multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain#RankScoreSource
Code Ranking242
40.0
AA
General Ranking268
39.0
AA
Math Reasoning233
37.0
AA
Science282
40.0
AA

Benchmark Scores (LLM Stats)

Biology

GPQA71.2%SR

Code

LiveCodeBench63.6%SR

Creativity

Arena Hard58.3%SR

Finance

MMLU-Pro78.0%SR

General

MMMU-Pro60.0%SR
IFBench48.0%SR

Language

COLLIE62.9%SR

Long Context

AA-LCR71.2%SR

Math

AIME 202583.8%SR

AA Evaluation Indices

Math Index
36.7
Intelligence Index
15.5
Mmlu Pro
0.8
Gpqa
0.6
Livecodebench
0.4
Ifbench
0.4
Aime 25
0.4
Scicode
0.3
Lcr
0.3
Tau2
0.2
Terminalbench Hard
0.2
Hle
0.0

LLM Stats Category Scores

Legal
80
Math
80
Finance
80
Healthcare
80
Language
70
Long Context
70
Physics
70
Reasoning
70
Biology
70
Chemistry
70
Multimodal
60
General
60
Code
60
Creativity
60
Vision
60
Writing
60
Instruction Following
50

Pricing

Input PriceFree
Output PriceFree
Blended Price (3:1)Free

Speed

Tokens/sec70.3
Time to First Token0.71s
Time to Answer0.71s

Provider Price Ranking

Provider Price Ranking

9 providers

Cheapest: ScalewayMost Expensive: Merge Gateway
ProviderInputOutput
1ScalewayCheapest
$0.4
$2
2NanoGPT
$0.4
$1.4
3OpenRouter
$0.4
$2
4Kilo Gateway
$0.4
$2
5Amazon Bedrock
$0.4
$2
6Mistral
$0.4
$2
7Vercel AI Gateway
$0.4
$2
8LLM Gateway
$0.4
$2
9Merge Gateway
$0.4
$2

Compare pricing across different API providers for this model.

External Sources