Phi-4

MicrosoftPhi開源權重MIT · 商用許可

描述

phi-4 is a state-of-the-art open model built to excel at advanced reasoning, coding, and knowledge tasks. It leverages a blend of synthetic data, filtered web data, academic texts, and supervised fine-tuning for precision, alignment, and safety.

發布日期

2024-12-12

參數規模

14.7B

上下文長度

16K

支援模態

text

能力雷達圖

general

coding

reasoning

science估算

agents

multimodal

Science 在缺少專門科學評測時使用推理能力代理估算。

排行榜排名

領域	#排名	分數	來源
程式碼能力榜	447	10.0	AA
通用能力榜	431	22.0	AA
數學推理	267	30.0	AA
科學能力	320	35.0	AA

基準測試分數 (LLM Stats)

Biology

GPQA

56.1%自報

Code

HumanEval

82.6%自報

Creativity

Arena Hard

75.4%自報

Factuality

SimpleQA

3.0%自報

Finance

MMLU

84.8%自報

MMLU-Pro

70.4%自報

General

IFEval

63.0%自報

PhiBench

56.2%自報

LiveBench

47.6%自報

Math

MGSM

80.6%自報

MATH

80.4%自報

DROP

75.5%自報

Reasoning

HumanEval+

82.8%自報

AA 評測指數

Math Index

18.0

Intelligence Index

4.9

Math 500

0.8

Mmlu Pro

0.7

Gpqa

0.6

Scicode

0.3

Ifbench

0.2

Livecodebench

0.2

Aime 25

0.2

Aime

0.1

Hle

0.0

Terminalbench Hard

0.0

Lcr

0.0

Tau2

0.0

LLM Stats 分類評分

Language

Legal

Finance

Healthcare

Code

Creativity

Writing

Math

Reasoning

Instruction Following

Physics

Structured Output

General

Biology

Chemistry

Factuality

定價

輸入價格$0.125 / 1M tokens

輸出價格$0.5 / 1M tokens

混合價格(3:1)$0.219 / 1M tokens

速度

Tokens/秒40.8

首Token延遲0.47s

首回答延遲0.47s

供應商價格排行

3 個供應商

最便宜: Microsoft最貴: Azure

供應商輸入輸出

1Microsoft主要

$0.125

$0.5

2Azure Cognitive Services

$0.17

$0.68

3Azure

$0.17

$0.68

比較該模型在不同 API 供應商之間的定價。

外部連結

LLM Stats Artificial Analysis