Skip to main content

Seed 2.1 Pro

ByteDanceProprietary

Description

ByteDance's flagship next-generation agent model built for real-world productivity. A deep-thinking model with strong demand understanding, long-horizon planning, and continuous self-repair, it delivers reliable end-to-end results across complex coding, long-chain agents, and multi-step engineering workflows. Seed 2.1 Pro also advances knowledge, reasoning, and multimodal understanding, with SOTA results across several video understanding benchmarks. Served via Volcano Engine as Doubao-Seed-2.1-pro.

Release Date
2026-06-24
Parameters
Context Length
Modalities

Capability Radar

80
general
60
coding
70
reasoning
51
scienceest.
70
agents
70
multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain#RankScoreSource
Agentic Capability38
60.0
LS
Multimodal Ranking70
70.0
LS
Reasoning79
56.0
LS

Benchmark Scores (LLM Stats)

3d

BLINK81.4%SR

Agents

GDPval87.9%SR
BrowseComp86.2%SR
MCP Atlas83.8%SR
OSWorld78.8%SR
Web Bench78.4%SR
MobileWorld73.1%SR
OfficeQA Pro72.2%SR
Terminal-Bench 2.171.0%SR
CyberGym70.2%SR
OneMillion Bench68.8%SR
Agent Startup Bench68.8%SR
SeedClawBench66.6%SR
Trae Error Fix63.3%SR
Trae Code Gen62.4%SR
WildClawBench61.7%SR
xDailyBench61.0%SR
Finance Agent v1.160.7%SR
SWE-Bench Pro57.5%SR
Repo Env55.0%SR
PresentBench54.6%SR
Workspace Bench53.0%SR
Doubao Multi-Turn Bench52.5%SR
ClawEval-MM51.0%SR
Toolathlon50.6%SR
Program Bench50.3%SR
NL2Repo47.0%SR
CreativeWork42.5%SR
Agents' Last Exam41.4%SR
SWE-Atlas35.2%SR
APEX-Agents33.8%SR
DeepSWE32.7%SR
GameWorld31.2%SR
PostTrainBench16.5%SR

Biology

SciCode59.8%SR

Chemistry

SuperGPQA70.8%SR
SuperChem59.8%SR

Code

Artifacts Bench51.0%SR
FrontierCS46.3%SR

Coding

AetherCode65.8%SR
Image2FloorPlan48.0%SR

Embodied

EmbSpatialBench0.83 / 100SR

General

MMMU-Pro82.7%SR
SimpleVQA0.74 / 100SR
MSQA50.2%SR
KINA48.3%SR

Image To Text

OCRBench_V263.2%SR

Knowledge

VideoSimpleQA76.4%SR
WorldBench67.6%SR

Long Context

DUDE82.8%SR
LongVideoBench80.6%SR
MMLongBench-128K78.3%SR
LVBench78.0%SR

Math

MathVision94.5%SR
MathVista90.7%SR
MathVerse89.7%SR
Beyond AIME87.0%SR
EMMA79.3%SR
FrontierScience Olympiad75.0%SR
DynaMath73.1%SR
IMO 20250.65 / 42SR
Humanity's Last Exam55.7%SR
IMOProof-Adv54.3%SR
MathArena Apex31.3%SR
LiveMathematicianBench20.9%SR
HorizonMath2.0%SR

Multimodal

CharXiv-D95.5%SR
Video-MME89.2%SR
CharXiv-R86.4%SR
VLMsAreBiased83.6%SR
OVOBench80.7%SR
TVBench80.5%SR
TOMATO79.5%SR
LiveSports-3K76.8%SR
MotionBench74.9%SR
BabyVision73.7%SR
TreeBench71.1%SR
ChartQAPro70.9%SR
Minerva70.7%SR
OVBench70.0%SR
VideoHolmes68.2%SR
CrossVid65.0%SR
ContPhy63.6%SR
MeasureBench62.9%SR
ZEROBench0.56 / 100SR
VisuLogic0.54 / 100SR
WorldVQA53.0%SR
VisFactor51.4%SR
MMSIBench35.9%SR

Physics

IPhO 202579.3%SR

Reasoning

ERQA72.0%SR
ArcAGI262.5%SR
FrontierScience Research28.3%SR

Spatial Reasoning

RealWorldQA86.7%SR

AA Evaluation Indices

No AA evaluation data available

LLM Stats Category Scores

Structured Output
100
Search
90
Legal
80
Long Context
80
Spatial Reasoning
80
Embodied
80
Finance
80
General
80
3d
80
Image To Text
70
Math
70
Multimodal
70
Physics
70
Reasoning
70
Safety
70
Healthcare
70
Chemistry
70
Economics
70
Tool Calling
70
Video
70
Vision
70
Agents
60
Biology
60
Code
60
Frontend Development
50
Coding
50
Science
30
Systems
20

Pricing

No pricing data available

Speed

No speed data available

Provider Price Ranking

No provider data available

External Sources