Llama 3.1 Nemotron Ultra 253B v1 (Reasoning)

NVIDIALlamaOpen WeightLlama 3.1 Community License

Descripción

A 253B parameter derivative of Meta Llama 3.1 405B Instruct, developed by NVIDIA using Neural Architecture Search (NAS) and vertical compression. It underwent multi-phase post-training (SFT for Math, Code, Reasoning, Chat, Tool Calling; RL with GRPO) to enhance reasoning and instruction-following. Optimized for accuracy/efficiency tradeoff on NVIDIA GPUs. Supports 128k context.

Fecha de lanzamiento

2025-04-07

Parámetros

253.0B

Longitud del contexto

—

Modalidades

—

Radar de capacidades

general

coding

reasoning

scienceest.

agents

multimodal

Science usa un proxy de razonamiento cuando los benchmarks científicos dedicados no están disponibles.

Rankings

Dominio	#Posición	Puntuación	Fuente
Ranking de codificación	307	28.0	AA
Ranking general	314	34.0	AA
Razonamiento matemático	108	73.0	AA
Ciencia	192	49.0	AA

Puntuaciones de benchmarks (LLM Stats)

Biology

GPQA

76.0%Aut.

Code

LiveCodeBench

66.3%Aut.

General

IFEval

89.5%Aut.

BFCL v2

74.1%Aut.

Math

MATH-500

97.0%Aut.

AIME 2025

72.5%Aut.

Índices de evaluación AA

Math Index

63.7

Intelligence Index

9.1

Math 500

1.0

Mmlu Pro

0.8

Aime

0.7

Gpqa

0.7

Livecodebench

0.6

Aime 25

0.6

Ifbench

0.4

Scicode

0.3

Tau2

0.1

Hle

0.1

Lcr

0.1

Terminalbench Hard

0.0

Puntuaciones por categoría LLM Stats

Instruction Following

Structured Output

Math

Physics

Reasoning

General

Biology

Chemistry

Code

Tool Calling

Precios

Precio de entrada$0.6 / 1M tokens

Precio de salida$1.8 / 1M tokens

Precio mixto (3:1)$0.9 / 1M tokens

Velocidad

Tokens/seg52.2

Retraso del primer token0.70s

Tiempo hasta la respuesta39.03s

Ranking de Precios por Proveedor

3 proveedores

Más barato: NVIDIAMás caro: LLM Gateway

ProveedorEntradaSalida

1NVIDIAPRINCIPAL

$0.6

$1.8

2Nebius Token Factory

$0.6

$1.8

3LLM Gateway

$0.6

$1.8

Comparar precios entre diferentes proveedores de API para este modelo.

Fuentes externas

LLM Stats Artificial Analysis