Muse Spark

MetaProprietary

설명

Muse Spark is the first model in the Muse family developed by Meta Superintelligence Labs. It is a natively multimodal reasoning model with support for tool-use, visual chain of thought, and multi-agent orchestration. It features a Contemplating mode that orchestrates multiple agents reasoning in parallel. It demonstrates competitive performance in multimodal perception, reasoning, health, and agentic tasks, with Contemplating mode achieving 58% on Humanity's Last Exam and 38% on FrontierScience Research.

출시일

2026-04-08

파라미터

—

컨텍스트 길이

—

모달리티

—

능력 레이더

general

coding

reasoning

science추정

agents

multimodal

전용 과학 벤치마크가 없을 때 Science는 추론 프록시를 사용하여 추정합니다.

랭킹

도메인	#순위	점수	소스
에이전트형 역량	67	54.0	LS
코딩 랭킹	33	79.0	AA
종합 랭킹	19	81.0	AA
멀티모달 랭킹	77	60.0	LS
추론	92	50.0	LS
과학	15	84.0	AA

벤치마크 점수 (LLM Stats)

Agents

GDPval-AA

1164.00 / 3000자체 보고

DeepSearchQA

74.8%자체 보고

Terminal-Bench 2.0

59.0%자체 보고

SWE-Bench Pro

52.4%자체 보고

Biology

GPQA

89.5%자체 보고

Code

LiveCodeBench Pro

0.80 / 3000자체 보고

SWE-Bench Verified

77.4%자체 보고

Communication

Tau2 Telecom

91.5%자체 보고

General

MMMU-Pro

80.4%자체 보고

SimpleVQA

0.71 / 100자체 보고

Grounding

ScreenSpot Pro

84.1%자체 보고

Healthcare

MedXpertQA

78.4%자체 보고

HealthBench Hard

42.8%자체 보고

Math

Humanity's Last Exam

58.4%자체 보고

Multimodal

CharXiv-R

86.4%자체 보고

ZEROBench

0.33 / 100자체 보고

Physics

IPhO 2025

82.6%자체 보고

Reasoning

ERQA

64.7%자체 보고

ARC-AGI v2

42.5%자체 보고

FrontierScience Research

38.3%자체 보고

AA 평가 지수

Coding Index

58.6

Intelligence Index

43.1

Tau2

0.9

Gpqa

0.9

Ifbench

0.8

Lcr

0.7

Terminalbench V2 1

0.6

Scicode

0.5

Terminalbench Hard

0.5

Hle

0.4

Tau Banking

0.2

LLM Stats 카테고리 점수

Legal

100

Finance

100

Agents

100

General

100

Reasoning

Physics

Biology

Chemistry

Communication

Frontend Development

Grounding

Tool Calling

Multimodal

Image To Text

Code

Vision

Math

Spatial Reasoning

Healthcare

가격

입력 가격무료

출력 가격무료

혼합 가격 (3:1)무료

속도

토큰/초0.0

첫 토큰 지연0.00s

첫 응답 지연0.00s

공급자 가격 순위

프로바이더 데이터가 없습니다

외부 링크

LLM Stats Artificial Analysis