Muse Spark

MetaProprietary

説明

Muse Spark is the first model in the Muse family developed by Meta Superintelligence Labs. It is a natively multimodal reasoning model with support for tool-use, visual chain of thought, and multi-agent orchestration. It features a Contemplating mode that orchestrates multiple agents reasoning in parallel. It demonstrates competitive performance in multimodal perception, reasoning, health, and agentic tasks, with Contemplating mode achieving 58% on Humanity's Last Exam and 38% on FrontierScience Research.

リリース日

2026-04-08

パラメータ

—

コンテキスト長

—

モダリティ

—

能力レーダー

general

coding

reasoning

science推定

agents

multimodal

専門的な科学ベンチマークが利用できない場合、Scienceは推論プロキシを使用して推定します。

ベンチマークスコア (LLM Stats)

Agents

GDPval-AA

1164.00 / 3000自己申告

DeepSearchQA

74.8%自己申告

Terminal-Bench 2.0

59.0%自己申告

SWE-Bench Pro

52.4%自己申告

Biology

GPQA

89.5%自己申告

Code

LiveCodeBench Pro

0.80 / 3000自己申告

SWE-Bench Verified

77.4%自己申告

Communication

Tau2 Telecom

91.5%自己申告

General

MMMU-Pro

80.4%自己申告

SimpleVQA

0.71 / 100自己申告

Grounding

ScreenSpot Pro

84.1%自己申告

Healthcare

MedXpertQA

78.4%自己申告

HealthBench Hard

42.8%自己申告

Math

Humanity's Last Exam

58.4%自己申告

Multimodal

CharXiv-R

86.4%自己申告

ZEROBench

0.33 / 100自己申告

Physics

IPhO 2025

82.6%自己申告

Reasoning

ERQA

64.7%自己申告

ARC-AGI v2

42.5%自己申告

FrontierScience Research

38.3%自己申告

AA評価指数

Coding Index

58.6

Intelligence Index

43.1

Tau2

0.9

Gpqa

0.9

Ifbench

0.8

Lcr

0.7

Terminalbench V2 1

0.6

Scicode

0.5

Terminalbench Hard

0.5

Hle

0.4

Tau Banking

0.2

LLM Statsカテゴリスコア

Legal

100

Finance

100

Agents

100

General

100

Reasoning

Physics

Biology

Chemistry

Communication

Frontend Development

Grounding

Tool Calling

Image To Text

Multimodal

Code

Vision

Math

Spatial Reasoning

Healthcare

価格設定

入力価格無料

出力価格無料

混合価格（3:1）無料

速度

トークン/秒0.0

初トークン遅延0.00s

初回答遅延0.00s

プロバイダー価格ランキング

プロバイダーデータがありません

外部リンク

LLM Stats Artificial Analysis

ドメイン	#順位	スコア	ソース
エージェント能力	67	54.0	LS
コーディングランキング	33	79.0	AA
総合ランキング	19	81.0	AA
マルチモーダルランキング	77	60.0	LS
推論	92	50.0	LS
科学	15	84.0	AA