Nemotron 3 Ultra (550B A55B)

NVIDIA오픈 웨이트OpenMDW License v1.1 · 상업적 사용 가능

설명

Nemotron 3 Ultra is NVIDIA's frontier-scale open model with 550B total / 55B active parameters, built for agentic reasoning, long-context analysis, tool use, and high-stakes RAG. It uses a hybrid Latent Mixture-of-Experts (LatentMoE) architecture interleaving Mamba-2, MoE, and select Attention layers, with Multi-Token Prediction (MTP) for native speculative decoding, and is pre-trained on ~20T tokens with an NVFP4 recipe. Reasoning is configurable on/off (plus a medium-effort mode) via the chat template. It supports up to a 1M-token context and 10 languages (English, French, Spanish, Italian, German, Japanese, Hindi, Korean, Brazilian Portuguese, Chinese). Released with open weights, training data, and recipes under the OpenMDW-1.1 license.

출시일

2026-06-04

파라미터

550.0B

컨텍스트 길이

1.0M

모달리티

text

능력 레이더

100

general

coding

reasoning

science추정

agents

multimodal

전용 과학 벤치마크가 없을 때 Science는 추론 프록시를 사용하여 추정합니다.

랭킹

도메인	#순위	점수	소스
에이전트형 역량	93	48.0	LS
추론	21	85.0	LS

벤치마크 점수 (LLM Stats)

Agents

GDPval-AA

1183.00 / 3000자체 보고

PinchBench

90.0%자체 보고

Terminal-Bench 2.1

56.4%자체 보고

ProfBench

56.0%자체 보고

Finance Agent

53.7%자체 보고

GDPval

46.7%자체 보고

BrowseComp

44.4%자체 보고

Finance Agent v2

37.5%자체 보고

TAU3-Bench

22.6%자체 보고

Biology

GPQA

87.0%자체 보고

SciCode

44.6%자체 보고

Code

SWE-Bench Verified

70.7%자체 보고

SWE-bench Multilingual

67.7%자체 보고

Communication

Multi-Challenge

63.8%자체 보고

Finance

MMLU-Pro

86.8%자체 보고

MMLU-ProX

83.0%자체 보고

General

LiveCodeBench v6

89.0%자체 보고

IFBench

81.7%자체 보고

LongBench v2

61.9%자체 보고

Knowledge

OmniScience

78.7%자체 보고

Language

WMT24++

83.7%자체 보고

Long Context

RULER

94.7%자체 보고

AA-LCR

65.4%자체 보고

Math

IMO-AnswerBench

92.3%자체 보고

Humanity's Last Exam

37.4%자체 보고

CritPT

3.1%자체 보고

Reasoning

Apex

84.8%자체 보고

AA 평가 지수

AA 평가 데이터가 없습니다

LLM Stats 카테고리 점수

Legal

100

Finance

100

General

100

Agents

100

Reasoning

Coding

Instruction Following

Language

Healthcare

Long Context

Physics

Frontend Development

Biology

Chemistry

Structured Output

Math

Code

Communication

Tool Calling

Vision

가격

입력 가격$0.5 / 1M 토큰

출력 가격$2.5 / 1M 토큰

혼합 가격 (3:1)$1 / 1M 토큰

캐시 읽기 가격$0.15 / 1M 토큰

속도

속도 데이터가 없습니다

공급자 가격 순위

4개 공급자

최저가: NVIDIA최고가: Together AI

공급자입력출력

1NVIDIA주요

$0.5

$2.5

2OpenRouter

$0.5

$2.5

3Vercel AI Gateway

$0.6

$2.4

4Together AI

$0.6

$3.6

이 모델의 다양한 API 공급자 간 가격 비교.

외부 링크

LLM Stats Artificial Analysis