Llama 3.1 Nemotron Instruct 70B

NVIDIALlamaOpen WeightLlama 3.1 Community License

Descripción

A large language model customized by NVIDIA to improve the helpfulness of LLM generated responses. It is a fine-tuned version of Llama 3.1 70B Instruct. The model was trained using RLHF (REINFORCE) with HelpSteer2-Preference prompts.

Fecha de lanzamiento

2024-10-15

Parámetros

70.0B

Longitud del contexto

—

Modalidades

—

Radar de capacidades

general

coding

reasoning

scienceest.

agents

multimodal

Science usa un proxy de razonamiento cuando los benchmarks científicos dedicados no están disponibles.

Rankings

Dominio	#Posición	Puntuación	Fuente
Ranking de codificación	436	11.0	AA
Ranking general	378	29.0	AA
Razonamiento matemático	282	26.0	AA
Razonamiento	18	86.0	LS
Ciencia	373	30.0	AA

Puntuaciones de benchmarks (LLM Stats)

Communication

MT-Bench

0.09 / 100Aut.

Finance

MMLU Chat

80.6%Aut.

MMLU

80.2%Aut.

TruthfulQA

58.6%Aut.

General

Instruct HumanEval

73.8%Aut.

ARC-C

69.2%Aut.

Language

Winogrande

84.5%Aut.

XLSum English

31.6%Aut.

Math

GSM8k

91.4%Aut.

GSM8K Chat

81.9%Aut.

Reasoning

HellaSwag

85.6%Aut.

Índices de evaluación AA

Math Index

11.0

Intelligence Index

7.6

Math 500

0.7

Mmlu Pro

0.7

Gpqa

0.5

Ifbench

0.3

Aime

0.2

Scicode

0.2

Tau2

0.2

Livecodebench

0.2

Aime 25

0.1

Lcr

0.1

Hle

0.0

Terminalbench Hard

0.0

Puntuaciones por categoría LLM Stats

Math

Language

Legal

Reasoning

Finance

Healthcare

General

Roleplay

Communication

Creativity

Precios

Precio de entrada$1.2 / 1M tokens

Precio de salida$1.2 / 1M tokens

Precio mixto (3:1)$1.2 / 1M tokens

Velocidad

Tokens/seg295.6

Retraso del primer token4.91s

Tiempo hasta la respuesta4.91s

Ranking de Precios por Proveedor

2 proveedores

Más barato: NanoGPTMás caro: NVIDIA

ProveedorEntradaSalida

1NanoGPTMás barato

$0.357

$0.408

2NVIDIAPRINCIPAL

$1.2

Comparar precios entre diferentes proveedores de API para este modelo.

Fuentes externas

LLM Stats Artificial Analysis