Phi-3.5-MoE-instruct

MicrosoftPhiOpen WeightMIT · Uso Comercial

Descripción

Phi-3.5-MoE-instruct is a mixture-of-experts model with ~42B total parameters (6.6B active) and a 128K context window. It excels at reasoning, math, coding, and multilingual tasks, outperforming larger dense models in many benchmarks. It underwent a thorough safety post-training process (SFT + DPO) and is licensed under MIT. This model is ideal for scenarios where efficiency and high performance are both required, particularly in multi-lingual or reasoning-intensive tasks.

Fecha de lanzamiento

2024-08-23

Parámetros

60.0B

Longitud del contexto

—

Modalidades

—

Radar de capacidades

general

coding

reasoning

scienceest.

agents

multimodal

Science usa un proxy de razonamiento cuando los benchmarks científicos dedicados no están disponibles.

Rankings

Dominio	#Posición	Puntuación	Fuente
Razonamiento	22	84.0	LS

Puntuaciones de benchmarks (LLM Stats)

Biology

GPQA

36.8%Aut.

Code

RepoQA

85.0%Aut.

HumanEval

70.7%Aut.

Creativity

Social IQa

78.0%Aut.

Arena Hard

37.9%Aut.

Finance

MMLU

78.9%Aut.

TruthfulQA

77.5%Aut.

MMLU-Pro

45.3%Aut.

General

ARC-C

91.0%Aut.

OpenBookQA

89.6%Aut.

PIQA

88.6%Aut.

MBPP

0.81 / 100Aut.

MMMLU

69.9%Aut.

Language

BoolQ

84.6%Aut.

MEGA XStoryCloze

82.8%Aut.

Winogrande

81.3%Aut.

BIG-Bench Hard

79.1%Aut.

MEGA XCOPA

76.6%Aut.

MEGA TyDi QA

67.1%Aut.

MEGA MLQA

65.3%Aut.

MEGA UDPOS

60.4%Aut.

SQuALITY

24.1%Aut.

Long Context

RULER

87.1%Aut.

Qasper

40.0%Aut.

GovReport

26.4%Aut.

QMSum

19.9%Aut.

SummScreenFD

16.9%Aut.

Math

GSM8k

88.7%Aut.

MATH

59.5%Aut.

MGSM

58.7%Aut.

Reasoning

HellaSwag

83.8%Aut.

Índices de evaluación AA

No hay datos de evaluación AA disponibles

Puntuaciones por categoría LLM Stats

Psychology

Language

Legal

Math

Reasoning

Finance

General

Healthcare

Code

Long Context

Physics

Creativity

Biology

Chemistry

Writing

Summarization

Precios

No hay datos de precios disponibles

Velocidad

No hay datos de velocidad disponibles

Ranking de Precios por Proveedor

2 proveedores

Más barato: Azure Cognitive ServicesMás caro: Azure

ProveedorEntradaSalida

1Azure Cognitive ServicesMás barato

$0.16

$0.64

2Azure

$0.16

$0.64

Comparar precios entre diferentes proveedores de API para este modelo.

Fuentes externas

LLM Stats Artificial Analysis