NVIDIA Nemotron 3 Nano 30B A3B (Non-reasoning)
Description
Nemotron 3 Nano is a 31.6B hybrid MoE model optimized for fast, long‑context agentic reasoning. It mixes Mamba‑2 and Transformer layers with a sparse MoE router (~3.6B active params per token) to deliver up to 4× higher throughput than Nemotron 2 and strong accuracy across math, coding, and tools. It supports a 1M‑token context window, offers Reasoning ON/OFF and a thinking‑budget to control costs, and ships with open weights, data, and RL tooling (NeMo Gym/RL). Released Dec 15, 2025 under the NVIDIA Open Model License, it’s built as the efficient backbone for multi‑agent systems at scale.
Capability Radar
Science uses a reasoning proxy when dedicated science benchmarks are unavailable.
Rankings
| Domain | #Rank | Score | Source |
|---|---|---|---|
| Agentic Capability | 124 | 9.0 | LS |
| Code Ranking | 348 | 22.0 | AA |
| General Ranking | 379 | 29.0 | AA |
| Math Reasoning | 329 | 13.0 | AA |
| Science | 396 | 27.0 | AA |
Benchmark Scores (LLM Stats)
Agents
Biology
Code
Communication
Creativity
Finance
General
Language
Math
AA Evaluation Indices
LLM Stats Category Scores
Pricing
Speed
Provider Price Ranking
Provider Price Ranking
6 providers
Compare pricing across different API providers for this model.