Nemotron 3 Ultra (550B A55B)

NVIDIAオープンウエイトOpenMDW License v1.1 · 商用利用可

説明

Nemotron 3 Ultra is NVIDIA's frontier-scale open model with 550B total / 55B active parameters, built for agentic reasoning, long-context analysis, tool use, and high-stakes RAG. It uses a hybrid Latent Mixture-of-Experts (LatentMoE) architecture interleaving Mamba-2, MoE, and select Attention layers, with Multi-Token Prediction (MTP) for native speculative decoding, and is pre-trained on ~20T tokens with an NVFP4 recipe. Reasoning is configurable on/off (plus a medium-effort mode) via the chat template. It supports up to a 1M-token context and 10 languages (English, French, Spanish, Italian, German, Japanese, Hindi, Korean, Brazilian Portuguese, Chinese). Released with open weights, training data, and recipes under the OpenMDW-1.1 license.

リリース日

2026-06-04

パラメータ

550.0B

コンテキスト長

1.0M

モダリティ

text

能力レーダー

100

general

coding

reasoning

science推定

agents

multimodal

専門的な科学ベンチマークが利用できない場合、Scienceは推論プロキシを使用して推定します。

ベンチマークスコア (LLM Stats)

Agents

GDPval-AA

1183.00 / 3000自己申告

PinchBench

90.0%自己申告

Terminal-Bench 2.1

56.4%自己申告

ProfBench

56.0%自己申告

Finance Agent

53.7%自己申告

GDPval

46.7%自己申告

BrowseComp

44.4%自己申告

Finance Agent v2

37.5%自己申告

TAU3-Bench

22.6%自己申告

Biology

GPQA

87.0%自己申告

SciCode

44.6%自己申告

Code

SWE-Bench Verified

70.7%自己申告

SWE-bench Multilingual

67.7%自己申告

Communication

Multi-Challenge

63.8%自己申告

Finance

MMLU-Pro

86.8%自己申告

MMLU-ProX

83.0%自己申告

General

LiveCodeBench v6

89.0%自己申告

IFBench

81.7%自己申告

LongBench v2

61.9%自己申告

Knowledge

OmniScience

78.7%自己申告

Language

WMT24++

83.7%自己申告

Long Context

RULER

94.7%自己申告

AA-LCR

65.4%自己申告

Math

IMO-AnswerBench

92.3%自己申告

Humanity's Last Exam

37.4%自己申告

CritPT

3.1%自己申告

Reasoning

Apex

84.8%自己申告

AA評価指数

AA評価データがありません

LLM Statsカテゴリスコア

Legal

100

Finance

100

General

100

Agents

100

Reasoning

Coding

Instruction Following

Language

Healthcare

Long Context

Physics

Frontend Development

Biology

Chemistry

Structured Output

Math

Code

Communication

Tool Calling

Vision

価格設定

入力価格$0.5 / 1Mトークン

出力価格$2.5 / 1Mトークン

混合価格（3:1）$1 / 1Mトークン

キャッシュ読み取り価格$0.15 / 1Mトークン

速度

速度データがありません

プロバイダー価格ランキング

4 プロバイダー

最安: NVIDIA最高: Together AI

プロバイダー入力出力

1NVIDIAプライマリ

$0.5

$2.5

2OpenRouter

$0.5

$2.2

3Vercel AI Gateway

$0.6

$2.4

4Together AI

$0.6

$3.6

このモデルの異なるAPIプロバイダー間の価格を比較。

外部リンク

LLM Stats Artificial Analysis

ドメイン	#順位	スコア	ソース
エージェント能力	93	48.0	LS
推論	21	85.0	LS