Skip to main content

Devstral Small (Jul '25)

MistralMistral
Release Date
2025-07-10
Parameters
Context Length
256K
Modalities
image, text

Capability Radar

24
general
25
coding
28
reasoning
28
scienceest.
27
agents
60
multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain#RankScoreSource
Code Ranking368
20.0
AA
General Ranking355
31.0
AA
Math Reasoning268
30.0
AA
Science400
27.0
AA

Benchmark Scores (LLM Stats)

Biology

GPQA71.2%SR

Code

LiveCodeBench63.6%SR

Creativity

Arena Hard58.3%SR

Finance

MMLU-Pro78.0%SR

General

MMMU-Pro60.0%SR
IFBench48.0%SR

Language

COLLIE62.9%SR

Long Context

AA-LCR71.2%SR

Math

AIME 202583.8%SR

AA Evaluation Indices

Math Index
29.3
Intelligence Index
9.3
Math 500
0.6
Mmlu Pro
0.6
Gpqa
0.4
Ifbench
0.3
Aime 25
0.3
Tau2
0.3
Livecodebench
0.3
Scicode
0.2
Lcr
0.2
Terminalbench Hard
0.1
Hle
0.0
Aime
0.0

LLM Stats Category Scores

Legal
80
Math
80
Finance
80
Healthcare
80
Language
70
Long Context
70
Physics
70
Reasoning
70
Biology
70
Chemistry
70
Multimodal
60
General
60
Code
60
Creativity
60
Vision
60
Writing
60
Instruction Following
50

Pricing

Input Price$0.1 / 1M tokens
Output Price$0.3 / 1M tokens
Blended Price (3:1)$0.15 / 1M tokens

Speed

Tokens/sec55.5
Time to First Token0.57s
Time to Answer0.57s

Provider Price Ranking

Provider Price Ranking

3 providers

Cheapest: MistralMost Expensive: Vercel AI Gateway
ProviderInputOutput
1MistralPRIMARY
$0.1
$0.3
2Kilo Gateway
$0.1
$0.3
3Vercel AI Gateway
$0.1
$0.3

Compare pricing across different API providers for this model.

External Sources