NVIDIA Nemotron 3 Nano 30B A3B (Non-reasoning)
NVIDIAOpen WeightNVIDIA Open Model License Agreement · Commercial OK
Descripción
Nemotron 3 Nano is a 31.6B hybrid MoE model optimized for fast, long‑context agentic reasoning. It mixes Mamba‑2 and Transformer layers with a sparse MoE router (~3.6B active params per token) to deliver up to 4× higher throughput than Nemotron 2 and strong accuracy across math, coding, and tools. It supports a 1M‑token context window, offers Reasoning ON/OFF and a thinking‑budget to control costs, and ships with open weights, data, and RL tooling (NeMo Gym/RL). Released Dec 15, 2025 under the NVIDIA Open Model License, it’s built as the efficient backbone for multi‑agent systems at scale.
Fecha de lanzamiento
2025-12-15
Parámetros
32.0B
Longitud del contexto
262K
Modalidades
text
Radar de capacidades
25
general
24
coding
18
reasoning
27
scienceest.
50
agents
0
multimodal
Science usa un proxy de razonamiento cuando los benchmarks científicos dedicados no están disponibles.
Rankings
| Dominio | #Posición | Puntuación | Fuente |
|---|---|---|---|
| Agents & Tools | 101 | 9.0 | LS |
| Code Ranking | 307 | 24.0 | AA |
| General Ranking | 356 | 31.0 | AA |
| Math Reasoning | 329 | 13.0 | AA |
| Science | 371 | 28.0 | AA |
Puntuaciones de benchmarks (LLM Stats)
Agents
Terminal-Bench
8.5%Aut.
Biology
GPQA
75.0%Aut.
SciCode
33.3%Aut.
Code
SWE-Bench Verified
38.8%Aut.
Communication
Tau2 Retail
56.9%Aut.
Tau2 Airline
48.0%Aut.
Tau2 Telecom
42.2%Aut.
Multi-Challenge
38.5%Aut.
Creativity
Arena-Hard v2
67.7%Aut.
Finance
MMLU-Pro
78.3%Aut.
MMLU-ProX
59.5%Aut.
General
LiveCodeBench v6
68.3%Aut.
Language
WMT24++
86.2%Aut.
Math
AIME 2025
99.2%Aut.
Humanity's Last Exam
15.5%Aut.
Índices de evaluación AA
Coding Index15.8
Math Index13.3
Intelligence Index13.2
Mmlu Pro0.6
Gpqa0.4
Ifbench0.4
Livecodebench0.4
Tau20.3
Scicode0.2
Aime 250.1
Terminalbench Hard0.1
Lcr0.1
Hle0.0
Puntuaciones por categoría LLM Stats
Writing70
Creativity70
Finance70
General70
Healthcare70
Language70
Legal70
Math60
Tool Calling50
Biology50
Chemistry50
Communication50
Physics50
Reasoning50
Frontend Development40
Code30
Vision20
Agents10
Precios
Precio de entrada$0.05 / 1M tokens
Precio de salida$0.2 / 1M tokens
Precio mixto (3:1)$0.088 / 1M tokens
Velocidad
Tokens/seg78.5 tokens/s
Retraso del primer token0.25s
Tiempo hasta la respuesta0.25s
Proveedores disponibles
(Unidades internas LS)| Proveedor | Precio de entrada | Precio de salida |
|---|---|---|
| DeepInfra | 60K | 240K |