Nemotron 3 Ultra (550B A55B)

NVIDIA開源權重OpenMDW License v1.1 · 商用許可

描述

Nemotron 3 Ultra is NVIDIA's frontier-scale open model with 550B total / 55B active parameters, built for agentic reasoning, long-context analysis, tool use, and high-stakes RAG. It uses a hybrid Latent Mixture-of-Experts (LatentMoE) architecture interleaving Mamba-2, MoE, and select Attention layers, with Multi-Token Prediction (MTP) for native speculative decoding, and is pre-trained on ~20T tokens with an NVFP4 recipe. Reasoning is configurable on/off (plus a medium-effort mode) via the chat template. It supports up to a 1M-token context and 10 languages (English, French, Spanish, Italian, German, Japanese, Hindi, Korean, Brazilian Portuguese, Chinese). Released with open weights, training data, and recipes under the OpenMDW-1.1 license.

發布日期

2026-06-04

參數規模

550.0B

上下文長度

1.0M

支援模態

text

能力雷達圖

100

general

coding

reasoning

science估算

agents

multimodal

Science 在缺少專門科學評測時使用推理能力代理估算。

排行榜排名

領域	#排名	分數	來源
智慧體能力模型榜	93	48.0	LS
推理能力	21	85.0	LS

基準測試分數 (LLM Stats)

Agents

GDPval-AA

1183.00 / 3000自報

PinchBench

90.0%自報

Terminal-Bench 2.1

56.4%自報

ProfBench

56.0%自報

Finance Agent

53.7%自報

GDPval

46.7%自報

BrowseComp

44.4%自報

Finance Agent v2

37.5%自報

TAU3-Bench

22.6%自報

Biology

GPQA

87.0%自報

SciCode

44.6%自報

Code

SWE-Bench Verified

70.7%自報

SWE-bench Multilingual

67.7%自報

Communication

Multi-Challenge

63.8%自報

Finance

MMLU-Pro

86.8%自報

MMLU-ProX

83.0%自報

General

LiveCodeBench v6

89.0%自報

IFBench

81.7%自報

LongBench v2

61.9%自報

Knowledge

OmniScience

78.7%自報

Language

WMT24++

83.7%自報

Long Context

RULER

94.7%自報

AA-LCR

65.4%自報

Math

IMO-AnswerBench

92.3%自報

Humanity's Last Exam

37.4%自報

CritPT

3.1%自報

Reasoning

Apex

84.8%自報

AA 評測指數

暫無 AA 評測資料

LLM Stats 分類評分

Legal

100

Finance

100

General

100

Agents

100

Reasoning

Coding

Instruction Following

Language

Healthcare

Long Context

Physics

Frontend Development

Biology

Chemistry

Structured Output

Math

Code

Communication

Tool Calling

Vision

定價

輸入價格$0.5 / 1M tokens

輸出價格$2.5 / 1M tokens

混合價格(3:1)$1 / 1M tokens

快取讀取價格$0.15 / 1M tokens

速度

暫無速度資料

供應商價格排行

4 個供應商

最便宜: NVIDIA最貴: Together AI

供應商輸入輸出

1NVIDIA主要

$0.5

$2.5

2OpenRouter

$0.5

$2.2

3Vercel AI Gateway

$0.6

$2.4

4Together AI

$0.6

$3.6

比較該模型在不同 API 供應商之間的定價。

外部連結

LLM Stats Artificial Analysis