GLM-4.5 (Reasoning)

Z AIGLMOpen WeightMIT · Commercial OK

説明

GLM-4.5 is an Agentic, Reasoning, and Coding (ARC) foundation model designed for intelligent agents, featuring 355 billion total parameters with 32 billion active parameters using MoE architecture. Trained on 23T tokens through multi-stage training, it is a hybrid reasoning model that provides two modes: thinking mode for complex reasoning and tool usage, and non-thinking mode for immediate responses. The model unifies agentic, reasoning, and coding capabilities with 128K context length support. It achieves exceptional performance with a score of 63.2 across 12 industry-standard benchmarks, placing 3rd among all proprietary and open-source models. Released under MIT open-source license allowing commercial use and secondary development.

リリース日

2025-07-28

パラメータ

355.0B

コンテキスト長

131K

モダリティ

text

能力レーダー

general

coding

reasoning

science推定

agents

multimodal

専門的な科学ベンチマークが利用できない場合、Scienceは推論プロキシを使用して推定します。

ベンチマークスコア (LLM Stats)

Agents

BFCL-v3

77.8%自己申告

Terminal-Bench

37.5%自己申告

BrowseComp

26.4%自己申告

Biology

GPQA

79.1%自己申告

SciCode

41.7%自己申告

Code

LiveCodeBench

72.9%自己申告

SWE-Bench Verified

64.2%自己申告

Communication

TAU-bench Retail

79.7%自己申告

TAU-bench Airline

60.4%自己申告

Finance

MMLU-Pro

84.6%自己申告

General

AA-Index

67.7%自己申告

Math

MATH-500

98.2%自己申告

AIME 2024

91.0%自己申告

Humanity's Last Exam

14.4%自己申告

AA評価指数

Math Index

73.7

Intelligence Index

26.4

Coding Index

26.3

Math 500

1.0

Aime

0.9

Mmlu Pro

0.8

Gpqa

0.8

Livecodebench

0.7

Aime 25

0.7

Lcr

0.5

Ifbench

0.4

Tau2

0.4

Scicode

0.3

Terminalbench Hard

0.2

Hle

0.1

LLM Statsカテゴリスコア

Structured Output

Finance

General

Healthcare

Language

Legal

Tool Calling

Communication

Math

Biology

Chemistry

Frontend Development

Physics

Reasoning

Agents

Code

Vision

価格設定

入力価格$0.6 / 1M tokens

出力価格$2.2 / 1M tokens

混合価格（3:1）$1 / 1M tokens

速度

トークン/秒42.4 tokens/s

初トークン遅延1.03s

初回答遅延48.20s

利用可能なプロバイダー

(LS内部単位)

プロバイダーデータがありません

外部リンク

LLM Stats

ドメイン	#順位	スコア	ソース
Agents & Tools	58	55.0	LS
Code Ranking	125	54.0	AA
General Ranking	187	52.0	AA
Math Reasoning	76	82.0	AA
Science	141	55.0	AA

説明

能力レーダー

ランキング

ベンチマークスコア (LLM Stats)

Agents

Biology

Code

Communication

Finance

General

Math

AA評価指数

LLM Statsカテゴリスコア

価格設定

速度

利用可能なプロバイダー

外部リンク