Nova 2 Omni
AmazonAmazonProprietary
描述
Amazon Nova 2 Omni is Amazon's first unified multimodal reasoning model that processes text, documents, images, video, and audio inputs and generates both text and images from a single model, eliminating multi-model coordination complexity. It delivers strong multimodal perception, core reasoning, agentic tool use, and high-quality image generation and editing, with configurable extended thinking. It supports a 1M token context window, 200+ languages for text, and 10 languages for speech input.
发布日期
2025-12-02
参数规模
—
上下文长度
—
支持模态
—
能力雷达图
70
general
0
coding
90
reasoning
68
science估算
70
agents
80
multimodal
Science 在缺少专门科学评测时使用推理能力代理估算。
排行榜排名
基准测试分数 (LLM Stats)
Agents
BFCL-V4
58.3%自报
Audio
MMAU
75.3%自报
MAVERIX
66.6%自报
CoVoST2
40.7%自报
Communication
Tau2 Telecom
80.0%自报
Tau2 Retail
78.3%自报
Multi-Challenge
75.5%自报
Tau2 Airline
68.8%自报
Document Understanding
RealKIE-FCC
59.8%自报
Finance
MMLU-Pro
80.7%自报
General
IFBench
68.7%自报
MMMU-Pro
61.4%自报
Grounding
RefCOCOg
86.3%自报
ScreenSpot
85.4%自报
Image To Text
OCRBench_V2
58.2%自报
Math
AIME 2025
92.1%自报
Multimodal
Video-MME
77.9%自报
QVHighlights
76.7%自报
AA 评测指数
暂无 AA 评测数据
LLM Stats 分类评分
Spatial Reasoning90
Grounding90
Math90
Video80
Finance80
Healthcare80
Legal80
Reasoning80
Communication80
Tool Calling70
Vision70
General70
Instruction Following70
Multimodal70
Document Understanding60
Image To Text60
Language60
Agents60
Speech To Text40
Audio40
定价
暂无定价数据
速度
暂无速度数据
供应商价格排行
暂无提供商数据