Phi-3.5-MoE-instruct

MicrosoftPhi開源權重MIT · 商用許可

描述

Phi-3.5-MoE-instruct is a mixture-of-experts model with ~42B total parameters (6.6B active) and a 128K context window. It excels at reasoning, math, coding, and multilingual tasks, outperforming larger dense models in many benchmarks. It underwent a thorough safety post-training process (SFT + DPO) and is licensed under MIT. This model is ideal for scenarios where efficiency and high performance are both required, particularly in multi-lingual or reasoning-intensive tasks.

發布日期

2024-08-23

參數規模

60.0B

上下文長度

—

支援模態

—

能力雷達圖

general

coding

reasoning

science估算

agents

multimodal

Science 在缺少專門科學評測時使用推理能力代理估算。

排行榜排名

領域	#排名	分數	來源
推理能力	22	84.0	LS

基準測試分數 (LLM Stats)

Biology

GPQA

36.8%自報

Code

RepoQA

85.0%自報

HumanEval

70.7%自報

Creativity

Social IQa

78.0%自報

Arena Hard

37.9%自報

Finance

MMLU

78.9%自報

TruthfulQA

77.5%自報

MMLU-Pro

45.3%自報

General

ARC-C

91.0%自報

OpenBookQA

89.6%自報

PIQA

88.6%自報

MBPP

0.81 / 100自報

MMMLU

69.9%自報

Language

BoolQ

84.6%自報

MEGA XStoryCloze

82.8%自報

Winogrande

81.3%自報

BIG-Bench Hard

79.1%自報

MEGA XCOPA

76.6%自報

MEGA TyDi QA

67.1%自報

MEGA MLQA

65.3%自報

MEGA UDPOS

60.4%自報

SQuALITY

24.1%自報

Long Context

RULER

87.1%自報

Qasper

40.0%自報

GovReport

26.4%自報

QMSum

19.9%自報

SummScreenFD

16.9%自報

Math

GSM8k

88.7%自報

MATH

59.5%自報

MGSM

58.7%自報

Reasoning

HellaSwag

83.8%自報

AA 評測指數

暫無 AA 評測資料

LLM Stats 分類評分

Psychology

Language

Legal

Math

Reasoning

Finance

General

Healthcare

Code

Long Context

Physics

Creativity

Biology

Chemistry

Writing

Summarization

定價

暫無定價資料

速度

暫無速度資料

供應商價格排行

2 個供應商

最便宜: Azure Cognitive Services最貴: Azure

供應商輸入輸出

1Azure Cognitive Services最便宜

$0.16

$0.64

2Azure

$0.16

$0.64

比較該模型在不同 API 供應商之間的定價。

外部連結

LLM Stats Artificial Analysis