Hermes 3 - Llama-3.1 70B

Nous ResearchLlamaオープンウエイトApache 2.0 · 商用利用可

説明

Hermes 3 70B is Nous Research's flagship instruction-following model, fine-tuned for advanced reasoning, creative writing, and complex task completion. It features exceptional instruction adherence and strong performance across multiple domains.

リリース日

2024-08-15

パラメータ

70.0B

コンテキスト長

131K

モダリティ

text

能力レーダー

general

coding

reasoning

science推定

agents

multimodal

専門的な科学ベンチマークが利用できない場合、Scienceは推論プロキシを使用して推定します。

ベンチマークスコア (LLM Stats)

Biology

GPQA

66.1%自己申告

Communication

MT-Bench

8.99 / 100自己申告

Finance

MMLU

79.1%自己申告

TruthfulQA

63.3%自己申告

MMLU-Pro

47.2%自己申告

General

PIQA

84.4%自己申告

ARC-E

83.0%自己申告

IFBench

81.2%自己申告

ARC-C

65.5%自己申告

AGIEval

56.2%自己申告

OpenBookQA

49.4%自己申告

Language

BoolQ

88.0%自己申告

Winogrande

83.2%自己申告

BBH

67.8%自己申告

Math

MATH

20.8%自己申告

Reasoning

HellaSwag

88.2%自己申告

MuSR

50.7%自己申告

AA評価指数

Intelligence Index

5.1

Mmlu Pro

0.6

Math 500

0.5

Gpqa

0.4

Scicode

0.2

Livecodebench

0.2

Hle

0.0

Aime

0.0

LLM Statsカテゴリスコア

Roleplay

Communication

Creativity

General

Reasoning

Instruction Following

Physics

Language

Biology

Chemistry

Legal

Finance

Healthcare

Math

価格設定

入力価格$0.3 / 1Mトークン

出力価格$0.3 / 1Mトークン

混合価格（3:1）$0.3 / 1Mトークン

速度

トークン/秒30.1

初トークン遅延0.35s

初回答遅延0.35s

プロバイダー価格ランキング

4 プロバイダー

最安: Nous Research最高: OpenRouter

プロバイダー入力出力

1Nous Researchプライマリ

$0.3

2Kilo Gateway

$0.3

3NanoGPT

$0.408

4OpenRouter

$0.7

このモデルの異なるAPIプロバイダー間の価格を比較。

外部リンク

LLM Stats Artificial Analysis

ドメイン	#順位	スコア	ソース
コーディングランキング	371	20.0	AA
総合ランキング	413	25.0	AA
数学的推論	279	27.0	AA
推論	48	70.0	LS
科学	401	27.0	AA