Step3 VL 10B
StepFunOpen WeightApache 2.0 · Commercial OK
描述
STEP3-VL-10B is a lightweight open-source foundation model designed to redefine the trade-off between compact efficiency and frontier-level multimodal intelligence. Built on a unified, fully unfrozen pre-training strategy on 1.2T multimodal tokens integrating a language-aligned Perception Encoder with a Qwen3-8B decoder. Features Parallel Coordinated Reasoning (PaCoRe) to scale test-time compute for complex perceptual reasoning.
发布日期
2026-01-20
参数规模
10.0B
上下文长度
—
支持模态
—
能力雷达图
14
general
17
coding
69
reasoning
44
science估算
0
agents
85
multimodal
Science 在缺少专门科学评测时使用推理能力代理估算。
排行榜排名
基准测试分数 (LLM Stats)
Communication
Multi-Challenge
62.6%自报
General
MMMU
78.1%自报
Math
AIME 2025
87.7%自报
MathVista
84.0%自报
MathVision
70.8%自报
Multimodal
MMBench
91.8%自报
AA 评测指数
Intelligence Index15.4
Coding Index13.9
Gpqa0.7
Ifbench0.5
Scicode0.3
Tau20.2
Hle0.1
Terminalbench Hard0.1
Lcr0.0
LLM Stats 分类评分
Vision80
General80
Healthcare80
Math80
Multimodal80
Reasoning80
Communication60
定价
输入价格免费
输出价格免费
混合价格(3:1)免费
速度
Tokens/秒0.0 tokens/s
首Token延迟0.00s
首回答延迟0.00s
可用提供商
(LS 内部计价单位)暂无提供商数据