Muse Spark
MetaProprietary
Descripción
Muse Spark is the first model in the Muse family developed by Meta Superintelligence Labs. It is a natively multimodal reasoning model with support for tool-use, visual chain of thought, and multi-agent orchestration. It features a Contemplating mode that orchestrates multiple agents reasoning in parallel. It demonstrates competitive performance in multimodal perception, reasoning, health, and agentic tasks, with Contemplating mode achieving 58% on Humanity's Last Exam and 38% on FrontierScience Research.
Fecha de lanzamiento
2026-04-08
Parámetros
—
Longitud del contexto
—
Modalidades
—
Radar de capacidades
49
general
48
coding
88
reasoning
66
scienceest.
80
agents
70
multimodal
Science usa un proxy de razonamiento cuando los benchmarks científicos dedicados no están disponibles.
Rankings
| Dominio | #Posición | Puntuación | Fuente |
|---|---|---|---|
| Agents & Tools | 48 | 57.0 | LS |
| Code Ranking | 18 | 82.0 | AA |
| General Ranking | 14 | 88.0 | AA |
| Multimodal Ranking | 68 | 60.0 | LS |
| Reasoning | 87 | 50.0 | LS |
| Science | 9 | 90.0 | AA |
Puntuaciones de benchmarks (LLM Stats)
Agents
GDPval-AA
1444.00 / 3000Aut.
DeepSearchQA
74.8%Aut.
Terminal-Bench 2.0
59.0%Aut.
SWE-Bench Pro
52.4%Aut.
Biology
GPQA
89.5%Aut.
Code
LiveCodeBench Pro
0.80 / 3000Aut.
SWE-Bench Verified
77.4%Aut.
Communication
Tau2 Telecom
91.5%Aut.
General
MMMU-Pro
80.4%Aut.
SimpleVQA
0.71 / 100Aut.
Grounding
ScreenSpot Pro
84.1%Aut.
Healthcare
MedXpertQA
78.4%Aut.
HealthBench Hard
42.8%Aut.
Math
Humanity's Last Exam
58.4%Aut.
Multimodal
CharXiv-R
86.4%Aut.
ZEROBench
0.33 / 100Aut.
Physics
IPhO 2025
82.6%Aut.
Reasoning
ERQA
64.7%Aut.
ARC-AGI v2
42.5%Aut.
FrontierScience Research
38.3%Aut.
Índices de evaluación AA
Intelligence Index52.1
Coding Index47.5
Tau20.9
Gpqa0.9
Ifbench0.8
Lcr0.7
Scicode0.5
Terminalbench Hard0.5
Hle0.4
Puntuaciones por categoría LLM Stats
Finance100
Legal100
Agents100
General100
Reasoning97
Biology90
Chemistry90
Communication90
Physics90
Tool Calling80
Frontend Development80
Grounding80
Vision70
Code70
Image To Text70
Multimodal70
Search70
Spatial Reasoning60
Healthcare60
Math60
Precios
Precio de entradaGratis
Precio de salidaGratis
Precio mixto (3:1)Gratis
Velocidad
Tokens/seg0.0 tokens/s
Retraso del primer token0.00s
Tiempo hasta la respuesta0.00s
Proveedores disponibles
(Unidades internas LS)No hay datos de proveedores disponibles