Nova 2 Omni

AmazonAmazonProprietary

描述

Amazon Nova 2 Omni is Amazon's first unified multimodal reasoning model that processes text, documents, images, video, and audio inputs and generates both text and images from a single model, eliminating multi-model coordination complexity. It delivers strong multimodal perception, core reasoning, agentic tool use, and high-quality image generation and editing, with configurable extended thinking. It supports a 1M token context window, 200+ languages for text, and 10 languages for speech input.

发布日期

2025-12-02

参数规模

—

上下文长度

—

支持模态

—

能力雷达图

general

coding

reasoning

science估算

agents

multimodal

Science 在缺少专门科学评测时使用推理能力代理估算。

排行榜排名

领域	#排名	分数	来源
智能体能力模型榜	50	58.0	LS
多模态榜	61	73.0	LS

基准测试分数 (LLM Stats)

Agents

BFCL-V4

58.3%自报

Audio

MMAU

75.3%自报

MAVERIX

66.6%自报

CoVoST2

40.7%自报

Communication

Tau2 Telecom

80.0%自报

Tau2 Retail

78.3%自报

Multi-Challenge

75.5%自报

Tau2 Airline

68.8%自报

Document Understanding

RealKIE-FCC

59.8%自报

Finance

MMLU-Pro

80.7%自报

General

IFBench

68.7%自报

MMMU-Pro

61.4%自报

Grounding

RefCOCOg

86.3%自报

ScreenSpot

85.4%自报

Image To Text

OCRBench_V2

58.2%自报

Math

AIME 2025

92.1%自报

Multimodal

Video-MME

77.9%自报

QVHighlights

76.7%自报

AA 评测指数

暂无 AA 评测数据

LLM Stats 分类评分

Math

Spatial Reasoning

Grounding

Reasoning

Legal

Finance

Healthcare

Communication

Video

Multimodal

Instruction Following

General

Tool Calling

Vision

Image To Text

Language

Document Understanding

Agents

Speech To Text

Audio

定价

暂无定价数据

速度

暂无速度数据

供应商价格排行

暂无提供商数据

外部链接

LLM Stats Artificial Analysis