Step3 VL 10B
StepFunOpen WeightApache 2.0 · Commercial OK
描述
STEP3-VL-10B is a lightweight open-source foundation model designed to redefine the trade-off between compact efficiency and frontier-level multimodal intelligence. Built on a unified, fully unfrozen pre-training strategy on 1.2T multimodal tokens integrating a language-aligned Perception Encoder with a Qwen3-8B decoder. Features Parallel Coordinated Reasoning (PaCoRe) to scale test-time compute for complex perceptual reasoning.
發布日期
2026-01-20
參數規模
10.0B
上下文長度
—
支援模態
—
能力雷達圖
14
general
17
coding
69
reasoning
44
science估算
0
agents
85
multimodal
Science 在缺少專門科學評測時使用推理能力代理估算。
排行榜排名
基準測試分數 (LLM Stats)
Communication
Multi-Challenge
62.6%自報
General
MMMU
78.1%自報
Math
AIME 2025
87.7%自報
MathVista
84.0%自報
MathVision
70.8%自報
Multimodal
MMBench
91.8%自報
AA 評測指數
Intelligence Index15.4
Coding Index13.9
Gpqa0.7
Ifbench0.5
Scicode0.3
Tau20.2
Hle0.1
Terminalbench Hard0.1
Lcr0.0
LLM Stats 分類評分
Vision80
General80
Healthcare80
Math80
Multimodal80
Reasoning80
Communication60
定價
輸入價格免費
輸出價格免費
混合價格(3:1)免費
速度
Tokens/秒0.0 tokens/s
首Token延遲0.00s
首回答延遲0.00s
可用提供商
(LS 內部計價單位)暫無提供商資料