Seed 2.1 Pro
ByteDanceProprietary
Description
ByteDance's flagship next-generation agent model built for real-world productivity. A deep-thinking model with strong demand understanding, long-horizon planning, and continuous self-repair, it delivers reliable end-to-end results across complex coding, long-chain agents, and multi-step engineering workflows. Seed 2.1 Pro also advances knowledge, reasoning, and multimodal understanding, with SOTA results across several video understanding benchmarks. Served via Volcano Engine as Doubao-Seed-2.1-pro.
Release Date
2026-06-24
Parameters
—
Context Length
—
Modalities
—
Capability Radar
80
general
60
coding
70
reasoning
51
scienceest.
70
agents
70
multimodal
Science uses a reasoning proxy when dedicated science benchmarks are unavailable.
Rankings
| Domain | #Rank | Score | Source |
|---|---|---|---|
| Agentic Capability | 38 | 60.0 | LS |
| Multimodal Ranking | 70 | 70.0 | LS |
| Reasoning | 79 | 56.0 | LS |
Benchmark Scores (LLM Stats)
3d
BLINK
81.4%SR
Agents
GDPval
87.9%SR
BrowseComp
86.2%SR
MCP Atlas
83.8%SR
OSWorld
78.8%SR
Web Bench
78.4%SR
MobileWorld
73.1%SR
OfficeQA Pro
72.2%SR
Terminal-Bench 2.1
71.0%SR
CyberGym
70.2%SR
OneMillion Bench
68.8%SR
Agent Startup Bench
68.8%SR
SeedClawBench
66.6%SR
Trae Error Fix
63.3%SR
Trae Code Gen
62.4%SR
WildClawBench
61.7%SR
xDailyBench
61.0%SR
Finance Agent v1.1
60.7%SR
SWE-Bench Pro
57.5%SR
Repo Env
55.0%SR
PresentBench
54.6%SR
Workspace Bench
53.0%SR
Doubao Multi-Turn Bench
52.5%SR
ClawEval-MM
51.0%SR
Toolathlon
50.6%SR
Program Bench
50.3%SR
NL2Repo
47.0%SR
CreativeWork
42.5%SR
Agents' Last Exam
41.4%SR
SWE-Atlas
35.2%SR
APEX-Agents
33.8%SR
DeepSWE
32.7%SR
GameWorld
31.2%SR
PostTrainBench
16.5%SR
Biology
SciCode
59.8%SR
Chemistry
SuperGPQA
70.8%SR
SuperChem
59.8%SR
Code
Artifacts Bench
51.0%SR
FrontierCS
46.3%SR
Coding
AetherCode
65.8%SR
Image2FloorPlan
48.0%SR
Embodied
EmbSpatialBench
0.83 / 100SR
General
MMMU-Pro
82.7%SR
SimpleVQA
0.74 / 100SR
MSQA
50.2%SR
KINA
48.3%SR
Image To Text
OCRBench_V2
63.2%SR
Knowledge
VideoSimpleQA
76.4%SR
WorldBench
67.6%SR
Long Context
DUDE
82.8%SR
LongVideoBench
80.6%SR
MMLongBench-128K
78.3%SR
LVBench
78.0%SR
Math
MathVision
94.5%SR
MathVista
90.7%SR
MathVerse
89.7%SR
Beyond AIME
87.0%SR
EMMA
79.3%SR
FrontierScience Olympiad
75.0%SR
DynaMath
73.1%SR
IMO 2025
0.65 / 42SR
Humanity's Last Exam
55.7%SR
IMOProof-Adv
54.3%SR
MathArena Apex
31.3%SR
LiveMathematicianBench
20.9%SR
HorizonMath
2.0%SR
Multimodal
CharXiv-D
95.5%SR
Video-MME
89.2%SR
CharXiv-R
86.4%SR
VLMsAreBiased
83.6%SR
OVOBench
80.7%SR
TVBench
80.5%SR
TOMATO
79.5%SR
LiveSports-3K
76.8%SR
MotionBench
74.9%SR
BabyVision
73.7%SR
TreeBench
71.1%SR
ChartQAPro
70.9%SR
Minerva
70.7%SR
OVBench
70.0%SR
VideoHolmes
68.2%SR
CrossVid
65.0%SR
ContPhy
63.6%SR
MeasureBench
62.9%SR
ZEROBench
0.56 / 100SR
VisuLogic
0.54 / 100SR
WorldVQA
53.0%SR
VisFactor
51.4%SR
MMSIBench
35.9%SR
Physics
IPhO 2025
79.3%SR
Reasoning
ERQA
72.0%SR
ArcAGI2
62.5%SR
FrontierScience Research
28.3%SR
Spatial Reasoning
RealWorldQA
86.7%SR
AA Evaluation Indices
No AA evaluation data available
LLM Stats Category Scores
Structured Output100
Search90
Legal80
Long Context80
Spatial Reasoning80
Embodied80
Finance80
General80
3d80
Image To Text70
Math70
Multimodal70
Physics70
Reasoning70
Safety70
Healthcare70
Chemistry70
Economics70
Tool Calling70
Video70
Vision70
Agents60
Biology60
Code60
Frontend Development50
Coding50
Science30
Systems20
Pricing
No pricing data available
Speed
No speed data available
Provider Price Ranking
No provider data available