GLM-5.1 (Reasoning)

Z AIGLMOpen WeightMIT · Commercial OK

説明

GLM-5.1 is Z.AI's next-generation flagship foundation model designed for long-horizon agentic engineering tasks. Built on a 754B MoE architecture (40B active parameters), it can work continuously and autonomously on a single task for up to 8 hours, completing the full loop from planning and execution to iterative optimization and delivery. GLM-5.1 achieves state-of-the-art on SWE-Bench Pro (58.4) and demonstrates strong performance across coding, reasoning, and agentic benchmarks. It supports 200K context length, 128K max output tokens, thinking mode, function calling, structured output, context caching, and MCP integration. Overall performance is aligned with Claude Opus 4.6 with particular strengths in sustained execution and complex engineering optimization.

リリース日

2026-04-07

パラメータ

754.0B

コンテキスト長

203K

モダリティ

text

能力レーダー

general

coding

reasoning

science推定

agents

multimodal

専門的な科学ベンチマークが利用できない場合、Scienceは推論プロキシを使用して推定します。

ベンチマークスコア (LLM Stats)

Agents

Vending-Bench 2

563441.0%自己申告

BrowseComp

79.3%自己申告

MCP Atlas

71.8%自己申告

TAU3-Bench

70.6%自己申告

Terminal-Bench 2.0

69.0%自己申告

CyberGym

68.7%自己申告

SWE-Bench Pro

58.4%自己申告

NL2Repo

42.7%自己申告

Toolathlon

40.7%自己申告

Biology

GPQA

86.2%自己申告

Math

AIME 2026

95.3%自己申告

HMMT 2025

94.0%自己申告

IMO-AnswerBench

83.8%自己申告

HMMT Feb 26

82.6%自己申告

Humanity's Last Exam

52.3%自己申告

AA評価指数

Intelligence Index

51.4

Coding Index

43.4

Tau2

1.0

Gpqa

0.9

Ifbench

0.8

Lcr

0.6

Scicode

0.4

Terminalbench Hard

0.4

Hle

0.3

LLM Statsカテゴリスコア

Agents

100

Reasoning

100

Biology

Chemistry

General

Physics

Math

Code

Safety

Tool Calling

Vision

Coding

価格設定

入力価格$1.4 / 1M tokens

出力価格$4.4 / 1M tokens

混合価格（3:1）$2.15 / 1M tokens

速度

トークン/秒53.8 tokens/s

初トークン遅延1.04s

初回答遅延71.55s

利用可能なプロバイダー

(LS内部単位)

プロバイダー	入力価格	出力価格
ZAI	1.4M	4.4M

外部リンク

LLM Stats

ドメイン	#順位	スコア	ソース
Agents & Tools	21	67.0	LS
Code Ranking	40	75.0	AA
General Ranking	9	90.0	AA
Science	33	76.0	AA

説明

能力レーダー

ランキング

ベンチマークスコア (LLM Stats)

Agents

Biology

Math

AA評価指数

LLM Statsカテゴリスコア

価格設定

速度

利用可能なプロバイダー

外部リンク