Phi-4

MicrosoftPhi开源权重MIT · 商用许可

描述

phi-4 is a state-of-the-art open model built to excel at advanced reasoning, coding, and knowledge tasks. It leverages a blend of synthetic data, filtered web data, academic texts, and supervised fine-tuning for precision, alignment, and safety.

发布日期

2024-12-12

参数规模

14.7B

上下文长度

16K

支持模态

text

能力雷达图

general

coding

reasoning

science估算

agents

multimodal

Science 在缺少专门科学评测时使用推理能力代理估算。

排行榜排名

领域	#排名	分数	来源
代码能力榜	447	10.0	AA
通用能力榜	431	22.0	AA
数学推理	267	30.0	AA
科学能力	320	35.0	AA

基准测试分数 (LLM Stats)

Biology

GPQA

56.1%自报

Code

HumanEval

82.6%自报

Creativity

Arena Hard

75.4%自报

Factuality

SimpleQA

3.0%自报

Finance

MMLU

84.8%自报

MMLU-Pro

70.4%自报

General

IFEval

63.0%自报

PhiBench

56.2%自报

LiveBench

47.6%自报

Math

MGSM

80.6%自报

MATH

80.4%自报

DROP

75.5%自报

Reasoning

HumanEval+

82.8%自报

AA 评测指数

Math Index

18.0

Intelligence Index

4.9

Math 500

0.8

Mmlu Pro

0.7

Gpqa

0.6

Scicode

0.3

Ifbench

0.2

Livecodebench

0.2

Aime 25

0.2

Aime

0.1

Hle

0.0

Terminalbench Hard

0.0

Lcr

0.0

Tau2

0.0

LLM Stats 分类评分

Language

Legal

Finance

Healthcare

Code

Creativity

Writing

Math

Reasoning

Instruction Following

Physics

Structured Output

General

Biology

Chemistry

Factuality

定价

输入价格$0.125 / 1M tokens

输出价格$0.5 / 1M tokens

混合价格(3:1)$0.219 / 1M tokens

速度

Tokens/秒40.8

首Token延迟0.47s

首回答延迟0.47s

供应商价格排行

3 个供应商

最便宜: Microsoft最贵: Azure

供应商输入输出

1Microsoft主要

$0.125

$0.5

2Azure Cognitive Services

$0.17

$0.68

3Azure

$0.17

$0.68

比较该模型在不同 API 供应商之间的定价。

外部链接

LLM Stats Artificial Analysis