Llama 3.1 Nemotron Instruct 70B

NVIDIALlama开源权重Llama 3.1 Community License

描述

A large language model customized by NVIDIA to improve the helpfulness of LLM generated responses. It is a fine-tuned version of Llama 3.1 70B Instruct. The model was trained using RLHF (REINFORCE) with HelpSteer2-Preference prompts.

发布日期

2024-10-15

参数规模

70.0B

上下文长度

—

支持模态

—

能力雷达图

general

coding

reasoning

science估算

agents

multimodal

Science 在缺少专门科学评测时使用推理能力代理估算。

排行榜排名

领域	#排名	分数	来源
代码能力榜	436	11.0	AA
通用能力榜	378	29.0	AA
数学推理	282	26.0	AA
推理能力	18	86.0	LS
科学能力	373	30.0	AA

基准测试分数 (LLM Stats)

Communication

MT-Bench

0.09 / 100自报

Finance

MMLU Chat

80.6%自报

MMLU

80.2%自报

TruthfulQA

58.6%自报

General

Instruct HumanEval

73.8%自报

ARC-C

69.2%自报

Language

Winogrande

84.5%自报

XLSum English

31.6%自报

Math

GSM8k

91.4%自报

GSM8K Chat

81.9%自报

Reasoning

HellaSwag

85.6%自报

AA 评测指数

Math Index

11.0

Intelligence Index

7.6

Math 500

0.7

Mmlu Pro

0.7

Gpqa

0.5

Ifbench

0.3

Aime

0.2

Scicode

0.2

Tau2

0.2

Livecodebench

0.2

Aime 25

0.1

Lcr

0.1

Hle

0.0

Terminalbench Hard

0.0

LLM Stats 分类评分

Math

Language

Legal

Reasoning

Finance

Healthcare

General

Roleplay

Communication

Creativity

定价

输入价格$1.2 / 1M tokens

输出价格$1.2 / 1M tokens

混合价格(3:1)$1.2 / 1M tokens

速度

Tokens/秒295.6

首Token延迟4.91s

首回答延迟4.91s

供应商价格排行

2 个供应商

最便宜: NanoGPT最贵: NVIDIA

供应商输入输出

1NanoGPT最便宜

$0.357

$0.408

2NVIDIA主要

$1.2

比较该模型在不同 API 供应商之间的定价。

外部链接

LLM Stats Artificial Analysis