Claude Mythos Preview

AnthropicClaudeProprietary

説明

Claude Mythos Preview is an unreleased general-purpose frontier model from Anthropic, a new tier above Opus (internal codename 'Capybara'). It identified thousands of zero-day vulnerabilities across every major operating system and web browser as part of Project Glasswing, a cross-industry cybersecurity initiative with 12 partners including AWS, Apple, Microsoft, and Google. State-of-the-art on SWE-bench Verified (93.9%), GPQA Diamond (94.6%), USAMO (97.6%), Terminal-Bench 2.0 (82.0%), CyberGym (83.1%), and Cybench (100% pass@1, saturated). Represents a 4.3x increase over the previous trendline for model performance. Deployed under ASL-3 Standard. Best-aligned Claude model to date per Anthropic's risk report, with the first-ever 24-hour internal alignment review before deployment. Not planned for general availability. Pricing for participants: $25/$125 per million tokens (input/output). 244-page system card.

リリース日

2026-05-07

パラメータ

—

コンテキスト長

—

モダリティ

image, text

能力レーダー

general

coding

reasoning

science推定

agents

multimodal

専門的な科学ベンチマークが利用できない場合、Scienceは推論プロキシを使用して推定します。

ベンチマークスコア (LLM Stats)

Agents

CyBench

100.0%自己申告

BrowseComp

86.9%自己申告

CyberGym

83.1%自己申告

Terminal-Bench 2.0

82.0%自己申告

OSWorld-Verified

79.6%自己申告

SWE-Bench Pro

77.8%自己申告

SWE-Bench Multimodal

59.0%自己申告

Biology

GPQA

94.6%自己申告

Code

SWE-Bench Verified

93.9%自己申告

SWE-bench Multilingual

87.3%自己申告

General

MMMLU

92.7%自己申告

Healthcare

FigQA

89.0%自己申告

Long Context

Graphwalks BFS >128k

80.0%自己申告

Math

USAMO25

97.6%自己申告

Humanity's Last Exam

64.7%自己申告

Multimodal

CharXiv-R

93.2%自己申告

AA評価指数

AA評価データがありません

LLM Statsカテゴリスコア

Language

Multimodal

Physics

Reasoning

Safety

Frontend Development

General

Healthcare

Biology

Chemistry

Long Context

Math

Spatial Reasoning

Agents

Code

Tool Calling

Vision

価格設定

価格データがありません

速度

速度データがありません

プロバイダー価格ランキング

プロバイダーデータがありません

外部リンク

LLM Stats Artificial Analysis

ドメイン	#順位	スコア	ソース
エージェント能力	2	79.0	LS
マルチモーダルランキング	3	93.0	LS