Llama 3.1 Nemotron Ultra 253B v1 (Reasoning)

NVIDIALlamaオープンウエイトLlama 3.1 Community License

説明

A 253B parameter derivative of Meta Llama 3.1 405B Instruct, developed by NVIDIA using Neural Architecture Search (NAS) and vertical compression. It underwent multi-phase post-training (SFT for Math, Code, Reasoning, Chat, Tool Calling; RL with GRPO) to enhance reasoning and instruction-following. Optimized for accuracy/efficiency tradeoff on NVIDIA GPUs. Supports 128k context.

リリース日

2025-04-07

パラメータ

253.0B

コンテキスト長

—

モダリティ

—

能力レーダー

general

coding

reasoning

science推定

agents

multimodal

専門的な科学ベンチマークが利用できない場合、Scienceは推論プロキシを使用して推定します。

ベンチマークスコア (LLM Stats)

Biology

GPQA

76.0%自己申告

Code

LiveCodeBench

66.3%自己申告

General

IFEval

89.5%自己申告

BFCL v2

74.1%自己申告

Math

MATH-500

97.0%自己申告

AIME 2025

72.5%自己申告

AA評価指数

Math Index

63.7

Intelligence Index

9.1

Math 500

1.0

Mmlu Pro

0.8

Aime

0.7

Gpqa

0.7

Livecodebench

0.6

Aime 25

0.6

Ifbench

0.4

Scicode

0.3

Tau2

0.1

Hle

0.1

Lcr

0.1

Terminalbench Hard

0.0

LLM Statsカテゴリスコア

Instruction Following

Structured Output

Math

Physics

Reasoning

General

Biology

Chemistry

Code

Tool Calling

価格設定

入力価格$0.6 / 1Mトークン

出力価格$1.8 / 1Mトークン

混合価格（3:1）$0.9 / 1Mトークン

速度

トークン/秒52.2

初トークン遅延0.70s

初回答遅延39.03s

プロバイダー価格ランキング

3 プロバイダー

最安: NVIDIA最高: LLM Gateway

プロバイダー入力出力

1NVIDIAプライマリ

$0.6

$1.8

2Nebius Token Factory

$0.6

$1.8

3LLM Gateway

$0.6

$1.8

このモデルの異なるAPIプロバイダー間の価格を比較。

外部リンク

LLM Stats Artificial Analysis

ドメイン	#順位	スコア	ソース
コーディングランキング	307	28.0	AA
総合ランキング	314	34.0	AA
数学的推論	108	73.0	AA
科学	192	49.0	AA