Seed 2.1 Pro

ByteDanceProprietary

설명

ByteDance's flagship next-generation agent model built for real-world productivity. A deep-thinking model with strong demand understanding, long-horizon planning, and continuous self-repair, it delivers reliable end-to-end results across complex coding, long-chain agents, and multi-step engineering workflows. Seed 2.1 Pro also advances knowledge, reasoning, and multimodal understanding, with SOTA results across several video understanding benchmarks. Served via Volcano Engine as Doubao-Seed-2.1-pro.

출시일

2026-06-24

파라미터

—

컨텍스트 길이

—

모달리티

—

능력 레이더

general

coding

reasoning

science추정

agents

multimodal

전용 과학 벤치마크가 없을 때 Science는 추론 프록시를 사용하여 추정합니다.

랭킹

도메인	#순위	점수	소스
에이전트형 역량	38	60.0	LS
멀티모달 랭킹	70	70.0	LS
추론	79	56.0	LS

벤치마크 점수 (LLM Stats)

3d

BLINK

81.4%자체 보고

Agents

GDPval

87.9%자체 보고

BrowseComp

86.2%자체 보고

MCP Atlas

83.8%자체 보고

OSWorld

78.8%자체 보고

Web Bench

78.4%자체 보고

MobileWorld

73.1%자체 보고

OfficeQA Pro

72.2%자체 보고

Terminal-Bench 2.1

71.0%자체 보고

CyberGym

70.2%자체 보고

OneMillion Bench

68.8%자체 보고

Agent Startup Bench

68.8%자체 보고

SeedClawBench

66.6%자체 보고

Trae Error Fix

63.3%자체 보고

Trae Code Gen

62.4%자체 보고

WildClawBench

61.7%자체 보고

xDailyBench

61.0%자체 보고

Finance Agent v1.1

60.7%자체 보고

SWE-Bench Pro

57.5%자체 보고

Repo Env

55.0%자체 보고

PresentBench

54.6%자체 보고

Workspace Bench

53.0%자체 보고

Doubao Multi-Turn Bench

52.5%자체 보고

ClawEval-MM

51.0%자체 보고

Toolathlon

50.6%자체 보고

Program Bench

50.3%자체 보고

NL2Repo

47.0%자체 보고

CreativeWork

42.5%자체 보고

Agents' Last Exam

41.4%자체 보고

SWE-Atlas

35.2%자체 보고

APEX-Agents

33.8%자체 보고

DeepSWE

32.7%자체 보고

GameWorld

31.2%자체 보고

PostTrainBench

16.5%자체 보고

Biology

SciCode

59.8%자체 보고

Chemistry

SuperGPQA

70.8%자체 보고

SuperChem

59.8%자체 보고

Code

Artifacts Bench

51.0%자체 보고

FrontierCS

46.3%자체 보고

Coding

AetherCode

65.8%자체 보고

Image2FloorPlan

48.0%자체 보고

Embodied

EmbSpatialBench

0.83 / 100자체 보고

General

MMMU-Pro

82.7%자체 보고

SimpleVQA

0.74 / 100자체 보고

MSQA

50.2%자체 보고

KINA

48.3%자체 보고

Image To Text

OCRBench_V2

63.2%자체 보고

Knowledge

VideoSimpleQA

76.4%자체 보고

WorldBench

67.6%자체 보고

Long Context

DUDE

82.8%자체 보고

LongVideoBench

80.6%자체 보고

MMLongBench-128K

78.3%자체 보고

LVBench

78.0%자체 보고

Math

MathVision

94.5%자체 보고

MathVista

90.7%자체 보고

MathVerse

89.7%자체 보고

Beyond AIME

87.0%자체 보고

EMMA

79.3%자체 보고

FrontierScience Olympiad

75.0%자체 보고

DynaMath

73.1%자체 보고

IMO 2025

0.65 / 42자체 보고

Humanity's Last Exam

55.7%자체 보고

IMOProof-Adv

54.3%자체 보고

MathArena Apex

31.3%자체 보고

LiveMathematicianBench

20.9%자체 보고

HorizonMath

2.0%자체 보고

Multimodal

CharXiv-D

95.5%자체 보고

Video-MME

89.2%자체 보고

CharXiv-R

86.4%자체 보고

VLMsAreBiased

83.6%자체 보고

OVOBench

80.7%자체 보고

TVBench

80.5%자체 보고

TOMATO

79.5%자체 보고

LiveSports-3K

76.8%자체 보고

MotionBench

74.9%자체 보고

BabyVision

73.7%자체 보고

TreeBench

71.1%자체 보고

ChartQAPro

70.9%자체 보고

Minerva

70.7%자체 보고

OVBench

70.0%자체 보고

VideoHolmes

68.2%자체 보고

CrossVid

65.0%자체 보고

ContPhy

63.6%자체 보고

MeasureBench

62.9%자체 보고

ZEROBench

0.56 / 100자체 보고

VisuLogic

0.54 / 100자체 보고

WorldVQA

53.0%자체 보고

VisFactor

51.4%자체 보고

MMSIBench

35.9%자체 보고

Physics

IPhO 2025

79.3%자체 보고

Reasoning

ERQA

72.0%자체 보고

ArcAGI2

62.5%자체 보고

FrontierScience Research

28.3%자체 보고

Spatial Reasoning

RealWorldQA

86.7%자체 보고

AA 평가 지수

AA 평가 데이터가 없습니다

LLM Stats 카테고리 점수

Structured Output

100

Legal

Long Context

Spatial Reasoning

Embodied

Finance

General

Image To Text

Math

Multimodal

Physics

Reasoning

Safety

Healthcare

Chemistry

Economics

Tool Calling

Video

Vision

Agents

Biology

Code

Frontend Development

Coding

Science

Systems

가격

가격 데이터가 없습니다

속도

속도 데이터가 없습니다

공급자 가격 순위

프로바이더 데이터가 없습니다

외부 링크

LLM Stats Artificial Analysis