Muse Spark
MetaProprietary
Описание
Muse Spark is the first model in the Muse family developed by Meta Superintelligence Labs. It is a natively multimodal reasoning model with support for tool-use, visual chain of thought, and multi-agent orchestration. It features a Contemplating mode that orchestrates multiple agents reasoning in parallel. It demonstrates competitive performance in multimodal perception, reasoning, health, and agentic tasks, with Contemplating mode achieving 58% on Humanity's Last Exam and 38% on FrontierScience Research.
Дата выхода
2026-04-08
Параметры
—
Длина контекста
—
Модальности
—
Радар способностей
49
general
48
coding
88
reasoning
66
scienceоцен.
80
agents
70
multimodal
Science использует прокси на основе рассуждений, когда специализированные научные бенчмарки недоступны.
Рейтинги
| Домен | #Место | Оценка | Источник |
|---|---|---|---|
| Agents & Tools | 48 | 57.0 | LS |
| Code Ranking | 18 | 82.0 | AA |
| General Ranking | 14 | 88.0 | AA |
| Multimodal Ranking | 68 | 60.0 | LS |
| Reasoning | 87 | 50.0 | LS |
| Science | 9 | 90.0 | AA |
Оценки бенчмарков (LLM Stats)
Agents
GDPval-AA
1444.00 / 3000Сам.
DeepSearchQA
74.8%Сам.
Terminal-Bench 2.0
59.0%Сам.
SWE-Bench Pro
52.4%Сам.
Biology
GPQA
89.5%Сам.
Code
LiveCodeBench Pro
0.80 / 3000Сам.
SWE-Bench Verified
77.4%Сам.
Communication
Tau2 Telecom
91.5%Сам.
General
MMMU-Pro
80.4%Сам.
SimpleVQA
0.71 / 100Сам.
Grounding
ScreenSpot Pro
84.1%Сам.
Healthcare
MedXpertQA
78.4%Сам.
HealthBench Hard
42.8%Сам.
Math
Humanity's Last Exam
58.4%Сам.
Multimodal
CharXiv-R
86.4%Сам.
ZEROBench
0.33 / 100Сам.
Physics
IPhO 2025
82.6%Сам.
Reasoning
ERQA
64.7%Сам.
ARC-AGI v2
42.5%Сам.
FrontierScience Research
38.3%Сам.
Индексы оценки AA
Intelligence Index52.1
Coding Index47.5
Tau20.9
Gpqa0.9
Ifbench0.8
Lcr0.7
Scicode0.5
Terminalbench Hard0.5
Hle0.4
Оценки категорий LLM Stats
Finance100
Legal100
Agents100
General100
Reasoning97
Biology90
Chemistry90
Communication90
Physics90
Tool Calling80
Frontend Development80
Grounding80
Vision70
Code70
Image To Text70
Multimodal70
Search70
Spatial Reasoning60
Healthcare60
Math60
Цены
Цена вводаБесплатно
Цена выводаБесплатно
Смешанная цена (3:1)Бесплатно
Скорость
Токенов/сек0.0 tokens/s
Задержка первого токена0.00s
Время до первого ответа0.00s
Доступные провайдеры
(Внутренние единицы LS)Нет данных провайдеров