NVIDIA Nemotron 3 Nano 30B A3B (Non-reasoning)
NVIDIAOpen WeightNVIDIA Open Model License Agreement · Commercial OK
Описание
Nemotron 3 Nano is a 31.6B hybrid MoE model optimized for fast, long‑context agentic reasoning. It mixes Mamba‑2 and Transformer layers with a sparse MoE router (~3.6B active params per token) to deliver up to 4× higher throughput than Nemotron 2 and strong accuracy across math, coding, and tools. It supports a 1M‑token context window, offers Reasoning ON/OFF and a thinking‑budget to control costs, and ships with open weights, data, and RL tooling (NeMo Gym/RL). Released Dec 15, 2025 under the NVIDIA Open Model License, it’s built as the efficient backbone for multi‑agent systems at scale.
Дата выхода
2025-12-15
Параметры
32.0B
Длина контекста
262K
Модальности
text
Радар способностей
25
general
24
coding
18
reasoning
27
scienceоцен.
50
agents
0
multimodal
Science использует прокси на основе рассуждений, когда специализированные научные бенчмарки недоступны.
Рейтинги
| Домен | #Место | Оценка | Источник |
|---|---|---|---|
| Agents & Tools | 101 | 9.0 | LS |
| Code Ranking | 307 | 24.0 | AA |
| General Ranking | 356 | 31.0 | AA |
| Math Reasoning | 329 | 13.0 | AA |
| Science | 371 | 28.0 | AA |
Оценки бенчмарков (LLM Stats)
Agents
Terminal-Bench
8.5%Сам.
Biology
GPQA
75.0%Сам.
SciCode
33.3%Сам.
Code
SWE-Bench Verified
38.8%Сам.
Communication
Tau2 Retail
56.9%Сам.
Tau2 Airline
48.0%Сам.
Tau2 Telecom
42.2%Сам.
Multi-Challenge
38.5%Сам.
Creativity
Arena-Hard v2
67.7%Сам.
Finance
MMLU-Pro
78.3%Сам.
MMLU-ProX
59.5%Сам.
General
LiveCodeBench v6
68.3%Сам.
Language
WMT24++
86.2%Сам.
Math
AIME 2025
99.2%Сам.
Humanity's Last Exam
15.5%Сам.
Индексы оценки AA
Coding Index15.8
Math Index13.3
Intelligence Index13.2
Mmlu Pro0.6
Gpqa0.4
Ifbench0.4
Livecodebench0.4
Tau20.3
Scicode0.2
Aime 250.1
Terminalbench Hard0.1
Lcr0.1
Hle0.0
Оценки категорий LLM Stats
Writing70
Creativity70
Finance70
General70
Healthcare70
Language70
Legal70
Math60
Tool Calling50
Biology50
Chemistry50
Communication50
Physics50
Reasoning50
Frontend Development40
Code30
Vision20
Agents10
Цены
Цена ввода$0.05 / 1M tokens
Цена вывода$0.2 / 1M tokens
Смешанная цена (3:1)$0.088 / 1M tokens
Скорость
Токенов/сек78.5 tokens/s
Задержка первого токена0.25s
Время до первого ответа0.25s
Доступные провайдеры
(Внутренние единицы LS)| Провайдер | Цена ввода | Цена вывода |
|---|---|---|
| DeepInfra | 60K | 240K |