Nova 2 Omni
AmazonAmazonProprietary
描述
Amazon Nova 2 Omni is Amazon's first unified multimodal reasoning model that processes text, documents, images, video, and audio inputs and generates both text and images from a single model, eliminating multi-model coordination complexity. It delivers strong multimodal perception, core reasoning, agentic tool use, and high-quality image generation and editing, with configurable extended thinking. It supports a 1M token context window, 200+ languages for text, and 10 languages for speech input.
發布日期
2025-12-02
參數規模
—
上下文長度
—
支援模態
—
能力雷達圖
70
general
0
coding
90
reasoning
68
science估算
70
agents
80
multimodal
Science 在缺少專門科學評測時使用推理能力代理估算。
排行榜排名
基準測試分數 (LLM Stats)
Agents
BFCL-V4
58.3%自報
Audio
MMAU
75.3%自報
MAVERIX
66.6%自報
CoVoST2
40.7%自報
Communication
Tau2 Telecom
80.0%自報
Tau2 Retail
78.3%自報
Multi-Challenge
75.5%自報
Tau2 Airline
68.8%自報
Document Understanding
RealKIE-FCC
59.8%自報
Finance
MMLU-Pro
80.7%自報
General
IFBench
68.7%自報
MMMU-Pro
61.4%自報
Grounding
RefCOCOg
86.3%自報
ScreenSpot
85.4%自報
Image To Text
OCRBench_V2
58.2%自報
Math
AIME 2025
92.1%自報
Multimodal
Video-MME
77.9%自報
QVHighlights
76.7%自報
AA 評測指數
暫無 AA 評測資料
LLM Stats 分類評分
Spatial Reasoning90
Grounding90
Math90
Video80
Finance80
Healthcare80
Legal80
Reasoning80
Communication80
Tool Calling70
Vision70
General70
Instruction Following70
Multimodal70
Document Understanding60
Image To Text60
Language60
Agents60
Speech To Text40
Audio40
定價
暫無定價資料
速度
暫無速度資料
供應商價格排行
暫無提供商資料