Phi-3.5-MoE-instruct
MicrosoftPhi開源權重MIT · 商用許可
描述
Phi-3.5-MoE-instruct is a mixture-of-experts model with ~42B total parameters (6.6B active) and a 128K context window. It excels at reasoning, math, coding, and multilingual tasks, outperforming larger dense models in many benchmarks. It underwent a thorough safety post-training process (SFT + DPO) and is licensed under MIT. This model is ideal for scenarios where efficiency and high performance are both required, particularly in multi-lingual or reasoning-intensive tasks.
發布日期
2024-08-23
參數規模
60.0B
上下文長度
—
支援模態
—
能力雷達圖
70
general
70
coding
70
reasoning
34
science估算
70
agents
0
multimodal
Science 在缺少專門科學評測時使用推理能力代理估算。
排行榜排名
| 領域 | #排名 | 分數 | 來源 |
|---|---|---|---|
| 推理能力 | 22 | 84.0 | LS |
基準測試分數 (LLM Stats)
Biology
GPQA
36.8%自報
Code
RepoQA
85.0%自報
HumanEval
70.7%自報
Creativity
Social IQa
78.0%自報
Arena Hard
37.9%自報
Finance
MMLU
78.9%自報
TruthfulQA
77.5%自報
MMLU-Pro
45.3%自報
General
ARC-C
91.0%自報
OpenBookQA
89.6%自報
PIQA
88.6%自報
MBPP
0.81 / 100自報
MMMLU
69.9%自報
Language
BoolQ
84.6%自報
MEGA XStoryCloze
82.8%自報
Winogrande
81.3%自報
BIG-Bench Hard
79.1%自報
MEGA XCOPA
76.6%自報
MEGA TyDi QA
67.1%自報
MEGA MLQA
65.3%自報
MEGA UDPOS
60.4%自報
SQuALITY
24.1%自報
Long Context
RULER
87.1%自報
Qasper
40.0%自報
GovReport
26.4%自報
QMSum
19.9%自報
SummScreenFD
16.9%自報
Math
GSM8k
88.7%自報
MATH
59.5%自報
MGSM
58.7%自報
Reasoning
HellaSwag
83.8%自報
AA 評測指數
暫無 AA 評測資料
LLM Stats 分類評分
Psychology80
Language70
Legal70
Math70
Reasoning70
Finance70
General70
Healthcare70
Code70
Long Context60
Physics60
Creativity60
Biology40
Chemistry40
Writing40
Summarization20
定價
暫無定價資料
速度
暫無速度資料
供應商價格排行
供應商價格排行
2 個供應商
最便宜: Azure Cognitive Services最貴: Azure
供應商輸入輸出
1Azure Cognitive Services最便宜
$0.16
$0.64
2Azure
$0.16
$0.64
比較該模型在不同 API 供應商之間的定價。