GLM-5.1 (Reasoning)

Z AIGLM开源权重MIT · 商用许可

描述

GLM-5.1 is Z.AI's next-generation flagship foundation model designed for long-horizon agentic engineering tasks. Built on a 754B MoE architecture (40B active parameters), it can work continuously and autonomously on a single task for up to 8 hours, completing the full loop from planning and execution to iterative optimization and delivery. GLM-5.1 achieves state-of-the-art on SWE-Bench Pro (58.4) and demonstrates strong performance across coding, reasoning, and agentic benchmarks. It supports 200K context length, 128K max output tokens, thinking mode, function calling, structured output, context caching, and MCP integration. Overall performance is aligned with Claude Opus 4.6 with particular strengths in sustained execution and complex engineering optimization.

发布日期

2026-04-07

参数规模

754.0B

上下文长度

200K

支持模态

text

能力雷达图

general

coding

reasoning

science估算

agents

multimodal

Science 在缺少专门科学评测时使用推理能力代理估算。

排行榜排名

领域	#排名	分数	来源
智能体能力模型榜	33	60.0	LS
代码能力榜	60	74.0	AA
通用能力榜	21	81.0	AA
科学能力	43	72.0	AA

基准测试分数 (LLM Stats)

Agents

Vending-Bench 2

563441.0%自报

GDPval-AA

1281.00 / 3000自报

BrowseComp

79.3%自报

MCP Atlas

71.8%自报

TAU3-Bench

70.6%自报

Terminal-Bench 2.0

69.0%自报

CyberGym

68.7%自报

SWE-Bench Pro

58.4%自报

Finance Agent v2

44.8%自报

NL2Repo

42.7%自报

Toolathlon

40.7%自报

FrontierSWE

31.0%自报

Biology

GPQA

86.2%自报

General

LiveBench

70.2%自报

Math

AIME 2026

95.3%自报

HMMT 2025

94.0%自报

IMO-AnswerBench

83.8%自报

HMMT Feb 26

82.6%自报

Humanity's Last Exam

52.3%自报

AA 评测指数

Coding Index

55.8

Intelligence Index

40.2

Tau2

1.0

Gpqa

0.9

Ifbench

0.8

Lcr

0.6

Terminalbench V2 1

0.6

Scicode

0.4

Terminalbench Hard

0.4

Hle

0.3

Tau Banking

0.1

LLM Stats 分类评分

Legal

100

Finance

100

Agents

100

Reasoning

100

General

100

Physics

Biology

Chemistry

Math

Safety

Code

Tool Calling

Vision

Coding

定价

输入价格$1.4 / 1M tokens

输出价格$4.4 / 1M tokens

混合价格(3:1)$2.15 / 1M tokens

缓存读取价格$0.26 / 1M tokens

缓存写入价格免费

速度

Tokens/秒99.8

首Token延迟0.80s

首回答延迟38.80s

供应商价格排行

25 个供应商

最便宜: ZAI最贵: Merge Gateway

供应商输入输出

1ZAI最便宜

2FriendliAI

3NanoGPT

$0.3

$2.55

4HPC-AI

$0.615

$2.46

5ZenMux

$0.8781

$3.5126

6Lilac

$0.9

7OpenRouter

$0.98

$3.08

8Hugging Face

$3.2

9Wafer

$3.2

10Synthetic

11routing.run

12Deep Infra

$1.05

$3.5

13FastRouter

$1.05

$3.5

14Kilo Gateway

$1.26

$3.96

15Baseten

$1.3

$4.3

16Z AI主要

$1.4

$4.4

17SiliconFlow (China)

$1.4

$4.4

18NovitaAI

$1.4

$4.4

19Weights & Biases

$1.4

$4.4

20Friendli

$1.4

$4.4

21SiliconFlow

$1.4

$4.4

22Vercel AI Gateway

$1.4

$4.4

23Together AI

$1.4

$4.4

24OrcaRouter

$1.4

$4.4

25Merge Gateway

$1.4

$4.4

比较该模型在不同 API 供应商之间的定价。

外部链接

LLM Stats Artificial Analysis