Skip to main content

Phi-3 Mini Instruct 3.8B

MicrosoftPhi
Release Date
2024-04-23
Parameters
Context Length
16K
Modalities
text

Capability Radar

16
general
11
coding
11
reasoning
18
scienceest.
11
agents
0
multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain#RankScoreSource
Code Ranking472
5.0
AA
General Ranking483
15.0
AA
Math Reasoning338
9.0
AA
Reasoning27
83.0
LS
Science466
16.0
AA

Benchmark Scores (LLM Stats)

Biology

GPQA56.1%SR

Code

HumanEval82.6%SR

Creativity

Arena Hard75.4%SR

Factuality

SimpleQA3.0%SR

Finance

MMLU84.8%SR
MMLU-Pro70.4%SR

General

IFEval63.0%SR
PhiBench56.2%SR
LiveBench47.6%SR

Math

MGSM80.6%SR
MATH80.4%SR
DROP75.5%SR

Reasoning

HumanEval+82.8%SR

AA Evaluation Indices

Intelligence Index
4.6
Math 500
0.5
Mmlu Pro
0.4
Gpqa
0.3
Math Index
0.3
Ifbench
0.2
Livecodebench
0.1
Scicode
0.1
Hle
0.0
Aime
0.0
Lcr
0.0
Aime 25
0.0
Terminalbench Hard
0.0
Tau2
0.0

LLM Stats Category Scores

Language
80
Legal
80
Finance
80
Healthcare
80
Code
80
Creativity
80
Writing
80
Math
70
Reasoning
70
Instruction Following
60
Physics
60
Structured Output
60
General
60
Biology
60
Chemistry
60
Factuality
0

Pricing

Input PriceFree
Output PriceFree
Blended Price (3:1)Free

Speed

Tokens/sec0.0
Time to First Token0.00s
Time to Answer0.00s

Provider Price Ranking

Provider Price Ranking

4 providers

Cheapest: Kilo GatewayMost Expensive: Azure
ProviderInputOutput
1Kilo GatewayCheapest
$0.06
$0.14
2OpenRouter
$0.065
$0.14
3Azure Cognitive Services
$0.13
$0.52
4Azure
$0.13
$0.52

Compare pricing across different API providers for this model.

External Sources