跳轉到主要內容

Seed 2.1 Turbo

ByteDanceProprietary

描述

ByteDance's low-cost, low-latency next-generation agent model for large-scale production. A deep-thinking model with the full feature set of Seed 2.1 Pro and comparable performance, it is designed for enterprise-grade deployments handling high volumes of online calls across coding, long-chain agents, and multimodal tasks. Served via Volcano Engine as Doubao-Seed-2.1-turbo.

發布日期
2026-06-24
參數規模
上下文長度
支援模態

能力雷達圖

80
general
50
coding
70
reasoning
51
science估算
70
agents
70
multimodal

Science 在缺少專門科學評測時使用推理能力代理估算。

排行榜排名

領域#排名分數來源
智慧體能力模型榜51
57.0
LS
多模態榜76
66.0
LS
推理能力73
57.0
LS

基準測試分數 (LLM Stats)

3d

BLINK79.4%自報

Agents

BrowseComp84.9%自報
GDPval82.7%自報
MCP Atlas80.3%自報
OSWorld76.4%自報
Web Bench73.6%自報
OfficeQA Pro71.1%自報
MobileWorld70.0%自報
Terminal-Bench 2.167.6%自報
CyberGym67.0%自報
OneMillion Bench66.6%自報
SeedClawBench63.8%自報
WildClawBench62.8%自報
Trae Code Gen59.7%自報
SWE-Bench Pro57.0%自報
Trae Error Fix56.7%自報
xDailyBench56.4%自報
Finance Agent v1.156.0%自報
Workspace Bench54.7%自報
Agent Startup Bench54.0%自報
Program Bench49.4%自報
Toolathlon49.1%自報
Doubao Multi-Turn Bench49.0%自報
PresentBench48.3%自報
Repo Env46.7%自報
ClawEval-MM46.0%自報
NL2Repo43.7%自報
CreativeWork34.5%自報
SWE-Atlas30.6%自報
APEX-Agents29.2%自報
GameWorld25.9%自報
DeepSWE23.0%自報
PostTrainBench18.3%自報

Biology

SciCode57.8%自報

Chemistry

SuperGPQA67.4%自報
SuperChem56.6%自報

Code

FrontierCS50.8%自報
Artifacts Bench47.0%自報

Coding

AetherCode67.9%自報
Image2FloorPlan35.9%自報

Embodied

EmbSpatialBench0.82 / 100自報

General

MMMU-Pro82.2%自報
SimpleVQA0.71 / 100自報
KINA46.6%自報
MSQA42.0%自報

Image To Text

OCRBench_V262.8%自報

Knowledge

VideoSimpleQA71.4%自報
WorldBench63.7%自報

Long Context

DUDE83.1%自報
LongVideoBench80.6%自報
MMLongBench-128K76.9%自報
LVBench76.8%自報

Math

MathVision92.7%自報
MathVista90.5%自報
MathVerse89.2%自報
Beyond AIME88.0%自報
EMMA78.4%自報
FrontierScience Olympiad76.0%自報
DynaMath68.1%自報
Humanity's Last Exam54.6%自報
MathArena Apex35.4%自報
LiveMathematicianBench27.7%自報
HorizonMath2.0%自報

Multimodal

CharXiv-D94.6%自報
Video-MME89.0%自報
CharXiv-R83.6%自報
OVOBench79.2%自報
TVBench77.2%自報
LiveSports-3K77.1%自報
MotionBench74.8%自報
TreeBench71.1%自報
ChartQAPro70.9%自報
OVBench69.7%自報
VLMsAreBiased68.3%自報
VideoHolmes67.6%自報
Minerva65.9%自報
CrossVid63.2%自報
BabyVision62.9%自報
ContPhy61.1%自報
MeasureBench58.9%自報
ZEROBench0.57 / 100自報
TOMATO56.8%自報
VisuLogic0.53 / 100自報
WorldVQA48.6%自報
VisFactor43.9%自報
MMSIBench31.4%自報

Reasoning

ERQA71.3%自報
ArcAGI261.3%自報
FrontierScience Research33.3%自報

Spatial Reasoning

RealWorldQA86.3%自報

AA 評測指數

暫無 AA 評測資料

LLM Stats 分類評分

Structured Output
90
Legal
80
Long Context
80
Search
80
Spatial Reasoning
80
Embodied
80
Finance
80
General
80
3d
80
Image To Text
70
Math
70
Multimodal
70
Safety
70
Healthcare
70
Economics
70
Tool Calling
70
Video
70
Vision
70
Physics
60
Reasoning
60
Agents
60
Biology
60
Chemistry
60
Frontend Development
50
Code
50
Coding
50
Science
30
Systems
20

定價

暫無定價資料

速度

暫無速度資料

供應商價格排行

暫無提供商資料

外部連結