NVIDIA Nemotron 3 Nano 30B A3B (Non-reasoning)

NVIDIAOpen WeightNVIDIA Open Model License Agreement · Uso Comercial

Descripción

Nemotron 3 Nano is a 31.6B hybrid MoE model optimized for fast, long‑context agentic reasoning. It mixes Mamba‑2 and Transformer layers with a sparse MoE router (~3.6B active params per token) to deliver up to 4× higher throughput than Nemotron 2 and strong accuracy across math, coding, and tools. It supports a 1M‑token context window, offers Reasoning ON/OFF and a thinking‑budget to control costs, and ships with open weights, data, and RL tooling (NeMo Gym/RL). Released Dec 15, 2025 under the NVIDIA Open Model License, it’s built as the efficient backbone for multi‑agent systems at scale.

Fecha de lanzamiento

2025-12-15

Parámetros

32.0B

Longitud del contexto

131K

Modalidades

text

Radar de capacidades

general

coding

reasoning

scienceest.

agents

multimodal

Science usa un proxy de razonamiento cuando los benchmarks científicos dedicados no están disponibles.

Rankings

Dominio	#Posición	Puntuación	Fuente
Capacidad agéntica	124	9.0	LS
Ranking de codificación	348	22.0	AA
Ranking general	379	29.0	AA
Razonamiento matemático	329	13.0	AA
Ciencia	396	27.0	AA

Puntuaciones de benchmarks (LLM Stats)

Agents

Terminal-Bench

8.5%Aut.

Biology

GPQA

75.0%Aut.

SciCode

33.3%Aut.

Code

SWE-Bench Verified

38.8%Aut.

Communication

Tau2 Retail

56.9%Aut.

Tau2 Airline

48.0%Aut.

Tau2 Telecom

42.2%Aut.

Multi-Challenge

38.5%Aut.

Creativity

Arena-Hard v2

67.7%Aut.

Finance

MMLU-Pro

78.3%Aut.

MMLU-ProX

59.5%Aut.

General

LiveCodeBench v6

68.3%Aut.

Language

WMT24++

86.2%Aut.

Math

AIME 2025

99.2%Aut.

Humanity's Last Exam

15.5%Aut.

Índices de evaluación AA

Math Index

13.3

Intelligence Index

7.4

Mmlu Pro

0.6

Gpqa

0.4

Ifbench

0.4

Livecodebench

0.4

Tau2

0.3

Scicode

0.2

Aime 25

0.1

Terminalbench Hard

0.1

Lcr

0.1

Hle

0.0

Puntuaciones por categoría LLM Stats

Language

Legal

Finance

General

Healthcare

Creativity

Writing

Math

Physics

Reasoning

Biology

Chemistry

Communication

Tool Calling

Frontend Development

Code

Vision

Agents

Precios

Precio de entrada$0.05 / 1M tokens

Precio de salida$0.2 / 1M tokens

Precio mixto (3:1)$0.088 / 1M tokens

Velocidad

Tokens/seg96.6

Retraso del primer token0.27s

Tiempo hasta la respuesta0.27s

Ranking de Precios por Proveedor

6 proveedores

Más barato: DeepInfraMás caro: NanoGPT

ProveedorEntradaSalida

1DeepInfraMás barato

2NVIDIAPRINCIPAL

$0.05

$0.2

3OpenRouter

$0.05

$0.2

4Kilo Gateway

$0.05

$0.2

5Vercel AI Gateway

$0.05

$0.24

6NanoGPT

$0.17

$0.68

Comparar precios entre diferentes proveedores de API para este modelo.

Fuentes externas

LLM Stats Artificial Analysis