Nemotron 3 Ultra (550B A55B)
描述
Nemotron 3 Ultra is NVIDIA's frontier-scale open model with 550B total / 55B active parameters, built for agentic reasoning, long-context analysis, tool use, and high-stakes RAG. It uses a hybrid Latent Mixture-of-Experts (LatentMoE) architecture interleaving Mamba-2, MoE, and select Attention layers, with Multi-Token Prediction (MTP) for native speculative decoding, and is pre-trained on ~20T tokens with an NVFP4 recipe. Reasoning is configurable on/off (plus a medium-effort mode) via the chat template. It supports up to a 1M-token context and 10 languages (English, French, Spanish, Italian, German, Japanese, Hindi, Korean, Brazilian Portuguese, Chinese). Released with open weights, training data, and recipes under the OpenMDW-1.1 license.
能力雷达图
Science 在缺少专门科学评测时使用推理能力代理估算。
排行榜排名
基准测试分数 (LLM Stats)
Agents
Biology
Code
Communication
Finance
General
Knowledge
Language
Long Context
Math
Reasoning
AA 评测指数
暂无 AA 评测数据
LLM Stats 分类评分
定价
速度
暂无速度数据
供应商价格排行
供应商价格排行
4 个供应商
比较该模型在不同 API 供应商之间的定价。