Skip to main content

Devstral Small (May '25)

MistralMistral
Release Date
2025-05-21
Parameters
Context Length
256K
Modalities
image, text

Capability Radar

26
general
26
coding
33
reasoning
29
scienceest.
31
agents
60
multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain#RankScoreSource
Code Ranking336
24.0
AA
General Ranking319
34.0
AA
Math Reasoning240
37.0
AA
Science384
29.0
AA

Benchmark Scores (LLM Stats)

Biology

GPQA71.2%SR

Code

LiveCodeBench63.6%SR

Creativity

Arena Hard58.3%SR

Finance

MMLU-Pro78.0%SR

General

MMMU-Pro60.0%SR
IFBench48.0%SR

Language

COLLIE62.9%SR

Long Context

AA-LCR71.2%SR

Math

AIME 202583.8%SR

AA Evaluation Indices

Intelligence Index
11.8
Math 500
0.7
Mmlu Pro
0.6
Gpqa
0.4
Tau2
0.4
Ifbench
0.3
Lcr
0.3
Livecodebench
0.3
Scicode
0.2
Aime
0.1
Terminalbench Hard
0.1
Hle
0.0

LLM Stats Category Scores

Legal
80
Math
80
Finance
80
Healthcare
80
Language
70
Long Context
70
Physics
70
Reasoning
70
Biology
70
Chemistry
70
Multimodal
60
General
60
Code
60
Creativity
60
Vision
60
Writing
60
Instruction Following
50

Pricing

Input PriceFree
Output PriceFree
Blended Price (3:1)Free

Speed

Tokens/sec0.0
Time to First Token0.00s
Time to Answer0.00s

Provider Price Ranking

Provider Price Ranking

3 providers

Cheapest: IO.NETMost Expensive: Mistral
ProviderInputOutput
1IO.NETCheapest
$0.05
$0.22
2NanoGPT
$0.06
$0.06
3Mistral
$0.1
$0.3

Compare pricing across different API providers for this model.

External Sources