NVIDIA Nemotron 3 Nano 30B A3B (Reasoning)
Description
Nemotron 3 Nano is a 31.6B hybrid MoE model optimized for fast, long‑context agentic reasoning. It mixes Mamba‑2 and Transformer layers with a sparse MoE router (~3.6B active params per token) to deliver up to 4× higher throughput than Nemotron 2 and strong accuracy across math, coding, and tools. It supports a 1M‑token context window, offers Reasoning ON/OFF and a thinking‑budget to control costs, and ships with open weights, data, and RL tooling (NeMo Gym/RL). Released Dec 15, 2025 under the NVIDIA Open Model License, it’s built as the efficient backbone for multi‑agent systems at scale.
Capability Radar
Science uses a reasoning proxy when dedicated science benchmarks are unavailable.
Rankings
| Domain | #Rank | Score | Source |
|---|---|---|---|
| Code Ranking | 191 | 49.0 | AA |
| General Ranking | 167 | 54.0 | AA |
| Math Reasoning | 29 | 92.0 | AA |
| Science | 202 | 48.0 | AA |
Benchmark Scores (LLM Stats)
Agents
Biology
Code
Communication
Creativity
Finance
General
Language
Math
AA Evaluation Indices
LLM Stats Category Scores
Pricing
Speed
Provider Price Ranking
Provider Price Ranking
3 providers
Compare pricing across different API providers for this model.