Phi-4 Mini Instruct

MicrosoftPhiOpen WeightMIT · Commercial OK

Description

Phi 4 Mini Instruct is a lightweight (3.8B parameters) open model built upon synthetic data and filtered web data, focusing on high-quality reasoning. It supports a 128K token context length and is enhanced for instruction adherence and safety via supervised fine-tuning and direct preference optimization.

Release Date

2024-02-26

Parameters

3.8B

Context Length

128K

Modalities

text

Capability Radar

general

coding

reasoning

scienceest.

agents

multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain	#Rank	Score	Source
Code Ranking	435	11.0	AA
General Ranking	481	15.0	AA
Math Reasoning	310	18.0	AA
Reasoning	54	69.0	LS
Science	452	17.0	AA

Benchmark Scores (LLM Stats)

Biology

GPQA

25.2%SR

Creativity

Social IQa

72.5%SR

Arena Hard

32.8%SR

Finance

MMLU

67.3%SR

TruthfulQA

66.4%SR

MMLU-Pro

52.8%SR

General

ARC-C

83.7%SR

OpenBookQA

79.2%SR

PIQA

77.6%SR

Multilingual MMLU

49.3%SR

Language

BoolQ

81.2%SR

BIG-Bench Hard

70.4%SR

Winogrande

67.0%SR

Math

GSM8k

88.6%SR

MATH

64.0%SR

MGSM

63.9%SR

Reasoning

HellaSwag

69.1%SR

AA Evaluation Indices

Math Index

6.7

Intelligence Index

3.0

Math 500

0.7

Mmlu Pro

0.5

Gpqa

0.3

Ifbench

0.2

Lcr

0.1

Livecodebench

0.1

Scicode

0.1

Tau2

0.1

Aime 25

0.1

Hle

0.0

Aime

0.0

Terminalbench Hard

0.0

LLM Stats Category Scores

Math

Psychology

Reasoning

Language

Legal

Finance

General

Healthcare

Physics

Creativity

Biology

Chemistry

Writing

Pricing

Input PriceFree

Output PriceFree

Blended Price (3:1)Free

Cache Read Price$0.08 / 1M tokens

Speed

Tokens/sec46.4

Time to First Token1.21s

Time to Answer1.21s

Provider Price Ranking

4 providers

Cheapest: Azure Cognitive ServicesMost Expensive: NanoGPT

ProviderInputOutput

1Azure Cognitive ServicesCheapest

$0.075

$0.3

2Azure

$0.075

$0.3

3Weights & Biases

$0.08

$0.35

4NanoGPT

$0.17

$0.68

Compare pricing across different API providers for this model.

External Sources

LLM Stats Artificial Analysis