Granite 3.3 8B (Non-reasoning)

IBMオープンウエイトApache 2.0 · 商用利用可

説明

Granite-3.3-8B-Base is a decoder-only language model with a 128K token context window. It improves upon Granite-3.1-8B-Base by adding support for Fill-in-the-Middle (FIM) using specialized tokens, enabling the model to generate content conditioned on both prefix and suffix. This makes it well-suited for code completion tasks

リリース日

2025-04-16

パラメータ

8.2B

コンテキスト長

—

モダリティ

text

能力レーダー

general

coding

reasoning

science推定

agents

multimodal

専門的な科学ベンチマークが利用できない場合、Scienceは推論プロキシを使用して推定します。

ベンチマークスコア (LLM Stats)

Code

HumanEval

89.7%自己申告

Creativity

AlpacaEval 2.0

62.7%自己申告

Arena Hard

57.6%自己申告

Finance

MMLU

63.9%自己申告

TruthfulQA

52.1%自己申告

General

TriviaQA

78.2%自己申告

IFEval

74.8%自己申告

ARC-C

50.8%自己申告

AGIEval

49.3%自己申告

36.5%自己申告

PopQA

26.2%自己申告

Language

Winogrande

74.4%自己申告

BIG-Bench Hard

69.1%自己申告

Math

AIME 2024

81.2%自己申告

MATH-500

69.0%自己申告

GSM8k

59.0%自己申告

DROP

36.1%自己申告

Reasoning

HumanEval+

86.1%自己申告

HellaSwag

80.1%自己申告

Safety

AttaQ

88.5%自己申告

AA評価指数

Math Index

6.7

Intelligence Index

1.8

Math 500

0.7

Mmlu Pro

0.5

Gpqa

0.3

Ifbench

0.2

Livecodebench

0.1

Tau2

0.1

Scicode

0.1

Aime 25

0.1

Aime

0.0

Lcr

0.0

Hle

0.0

Terminalbench Hard

0.0

LLM Statsカテゴリスコア

Safety

Code

Instruction Following

Language

Structured Output

Legal

Math

Reasoning

Finance

General

Healthcare

Creativity

Writing

価格設定

入力価格$0.03 / 1Mトークン

出力価格$0.25 / 1Mトークン

混合価格（3:1）$0.085 / 1Mトークン

速度

トークン/秒369.4

初トークン遅延21.86s

初回答遅延21.86s

プロバイダー価格ランキング

1 プロバイダー

プロバイダー入力出力

1IBMプライマリ

$0.03

$0.25

このモデルの異なるAPIプロバイダー間の価格を比較。

外部リンク

LLM Stats Artificial Analysis

ドメイン	#順位	スコア	ソース
コーディングランキング	467	7.0	AA
総合ランキング	484	15.0	AA
数学的推論	314	18.0	AA
推論	26	83.0	LS
科学	460	17.0	AA