跳转到主要内容

Seed 2.1 Turbo

ByteDanceProprietary

描述

ByteDance's low-cost, low-latency next-generation agent model for large-scale production. A deep-thinking model with the full feature set of Seed 2.1 Pro and comparable performance, it is designed for enterprise-grade deployments handling high volumes of online calls across coding, long-chain agents, and multimodal tasks. Served via Volcano Engine as Doubao-Seed-2.1-turbo.

发布日期
2026-06-24
参数规模
上下文长度
支持模态

能力雷达图

80
general
50
coding
70
reasoning
51
science估算
70
agents
70
multimodal

Science 在缺少专门科学评测时使用推理能力代理估算。

排行榜排名

领域#排名分数来源
智能体能力模型榜51
57.0
LS
多模态榜76
66.0
LS
推理能力73
57.0
LS

基准测试分数 (LLM Stats)

3d

BLINK79.4%自报

Agents

BrowseComp84.9%自报
GDPval82.7%自报
MCP Atlas80.3%自报
OSWorld76.4%自报
Web Bench73.6%自报
OfficeQA Pro71.1%自报
MobileWorld70.0%自报
Terminal-Bench 2.167.6%自报
CyberGym67.0%自报
OneMillion Bench66.6%自报
SeedClawBench63.8%自报
WildClawBench62.8%自报
Trae Code Gen59.7%自报
SWE-Bench Pro57.0%自报
Trae Error Fix56.7%自报
xDailyBench56.4%自报
Finance Agent v1.156.0%自报
Workspace Bench54.7%自报
Agent Startup Bench54.0%自报
Program Bench49.4%自报
Toolathlon49.1%自报
Doubao Multi-Turn Bench49.0%自报
PresentBench48.3%自报
Repo Env46.7%自报
ClawEval-MM46.0%自报
NL2Repo43.7%自报
CreativeWork34.5%自报
SWE-Atlas30.6%自报
APEX-Agents29.2%自报
GameWorld25.9%自报
DeepSWE23.0%自报
PostTrainBench18.3%自报

Biology

SciCode57.8%自报

Chemistry

SuperGPQA67.4%自报
SuperChem56.6%自报

Code

FrontierCS50.8%自报
Artifacts Bench47.0%自报

Coding

AetherCode67.9%自报
Image2FloorPlan35.9%自报

Embodied

EmbSpatialBench0.82 / 100自报

General

MMMU-Pro82.2%自报
SimpleVQA0.71 / 100自报
KINA46.6%自报
MSQA42.0%自报

Image To Text

OCRBench_V262.8%自报

Knowledge

VideoSimpleQA71.4%自报
WorldBench63.7%自报

Long Context

DUDE83.1%自报
LongVideoBench80.6%自报
MMLongBench-128K76.9%自报
LVBench76.8%自报

Math

MathVision92.7%自报
MathVista90.5%自报
MathVerse89.2%自报
Beyond AIME88.0%自报
EMMA78.4%自报
FrontierScience Olympiad76.0%自报
DynaMath68.1%自报
Humanity's Last Exam54.6%自报
MathArena Apex35.4%自报
LiveMathematicianBench27.7%自报
HorizonMath2.0%自报

Multimodal

CharXiv-D94.6%自报
Video-MME89.0%自报
CharXiv-R83.6%自报
OVOBench79.2%自报
TVBench77.2%自报
LiveSports-3K77.1%自报
MotionBench74.8%自报
TreeBench71.1%自报
ChartQAPro70.9%自报
OVBench69.7%自报
VLMsAreBiased68.3%自报
VideoHolmes67.6%自报
Minerva65.9%自报
CrossVid63.2%自报
BabyVision62.9%自报
ContPhy61.1%自报
MeasureBench58.9%自报
ZEROBench0.57 / 100自报
TOMATO56.8%自报
VisuLogic0.53 / 100自报
WorldVQA48.6%自报
VisFactor43.9%自报
MMSIBench31.4%自报

Reasoning

ERQA71.3%自报
ArcAGI261.3%自报
FrontierScience Research33.3%自报

Spatial Reasoning

RealWorldQA86.3%自报

AA 评测指数

暂无 AA 评测数据

LLM Stats 分类评分

Structured Output
90
Legal
80
Long Context
80
Search
80
Spatial Reasoning
80
Embodied
80
Finance
80
General
80
3d
80
Image To Text
70
Math
70
Multimodal
70
Safety
70
Healthcare
70
Economics
70
Tool Calling
70
Video
70
Vision
70
Physics
60
Reasoning
60
Agents
60
Biology
60
Chemistry
60
Frontend Development
50
Code
50
Coding
50
Science
30
Systems
20

定价

暂无定价数据

速度

暂无速度数据

供应商价格排行

暂无提供商数据

外部链接