Nova 2 Omni

AmazonAmazonProprietary

描述

Amazon Nova 2 Omni is Amazon's first unified multimodal reasoning model that processes text, documents, images, video, and audio inputs and generates both text and images from a single model, eliminating multi-model coordination complexity. It delivers strong multimodal perception, core reasoning, agentic tool use, and high-quality image generation and editing, with configurable extended thinking. It supports a 1M token context window, 200+ languages for text, and 10 languages for speech input.

發布日期

2025-12-02

參數規模

—

上下文長度

—

支援模態

—

能力雷達圖

general

coding

reasoning

science估算

agents

multimodal

Science 在缺少專門科學評測時使用推理能力代理估算。

排行榜排名

領域	#排名	分數	來源
智慧體能力模型榜	50	58.0	LS
多模態榜	61	73.0	LS

基準測試分數 (LLM Stats)

Agents

BFCL-V4

58.3%自報

Audio

MMAU

75.3%自報

MAVERIX

66.6%自報

CoVoST2

40.7%自報

Communication

Tau2 Telecom

80.0%自報

Tau2 Retail

78.3%自報

Multi-Challenge

75.5%自報

Tau2 Airline

68.8%自報

Document Understanding

RealKIE-FCC

59.8%自報

Finance

MMLU-Pro

80.7%自報

General

IFBench

68.7%自報

MMMU-Pro

61.4%自報

Grounding

RefCOCOg

86.3%自報

ScreenSpot

85.4%自報

Image To Text

OCRBench_V2

58.2%自報

Math

AIME 2025

92.1%自報

Multimodal

Video-MME

77.9%自報

QVHighlights

76.7%自報

AA 評測指數

暫無 AA 評測資料

LLM Stats 分類評分

Math

Spatial Reasoning

Grounding

Reasoning

Legal

Finance

Healthcare

Communication

Video

Multimodal

Instruction Following

General

Tool Calling

Vision

Image To Text

Language

Document Understanding

Agents

Speech To Text

Audio

定價

暫無定價資料

速度

暫無速度資料

供應商價格排行

暫無提供商資料

外部連結

LLM Stats Artificial Analysis