Claude Mythos Preview

AnthropicClaudeProprietary

설명

Claude Mythos Preview is an unreleased general-purpose frontier model from Anthropic, a new tier above Opus (internal codename 'Capybara'). It identified thousands of zero-day vulnerabilities across every major operating system and web browser as part of Project Glasswing, a cross-industry cybersecurity initiative with 12 partners including AWS, Apple, Microsoft, and Google. State-of-the-art on SWE-bench Verified (93.9%), GPQA Diamond (94.6%), USAMO (97.6%), Terminal-Bench 2.0 (82.0%), CyberGym (83.1%), and Cybench (100% pass@1, saturated). Represents a 4.3x increase over the previous trendline for model performance. Deployed under ASL-3 Standard. Best-aligned Claude model to date per Anthropic's risk report, with the first-ever 24-hour internal alignment review before deployment. Not planned for general availability. Pricing for participants: $25/$125 per million tokens (input/output). 244-page system card.

출시일

2026-05-07

파라미터

—

컨텍스트 길이

—

모달리티

image, text

능력 레이더

general

coding

reasoning

science추정

agents

multimodal

전용 과학 벤치마크가 없을 때 Science는 추론 프록시를 사용하여 추정합니다.

랭킹

도메인	#순위	점수	소스
에이전트형 역량	2	79.0	LS
멀티모달 랭킹	3	93.0	LS

벤치마크 점수 (LLM Stats)

Agents

CyBench

100.0%자체 보고

BrowseComp

86.9%자체 보고

CyberGym

83.1%자체 보고

Terminal-Bench 2.0

82.0%자체 보고

OSWorld-Verified

79.6%자체 보고

SWE-Bench Pro

77.8%자체 보고

SWE-Bench Multimodal

59.0%자체 보고

Biology

GPQA

94.6%자체 보고

Code

SWE-Bench Verified

93.9%자체 보고

SWE-bench Multilingual

87.3%자체 보고

General

MMMLU

92.7%자체 보고

Healthcare

FigQA

89.0%자체 보고

Long Context

Graphwalks BFS >128k

80.0%자체 보고

Math

USAMO25

97.6%자체 보고

Humanity's Last Exam

64.7%자체 보고

Multimodal

CharXiv-R

93.2%자체 보고

AA 평가 지수

AA 평가 데이터가 없습니다

LLM Stats 카테고리 점수

Language

Multimodal

Physics

Reasoning

Safety

Frontend Development

General

Healthcare

Biology

Chemistry

Long Context

Math

Spatial Reasoning

Agents

Code

Tool Calling

Vision

가격

가격 데이터가 없습니다

속도

속도 데이터가 없습니다

공급자 가격 순위

프로바이더 데이터가 없습니다

외부 링크

LLM Stats Artificial Analysis