GLM-5.1 (Reasoning)

Z AIGLMOpen WeightMIT · Uso Comercial

Descripción

GLM-5.1 is Z.AI's next-generation flagship foundation model designed for long-horizon agentic engineering tasks. Built on a 754B MoE architecture (40B active parameters), it can work continuously and autonomously on a single task for up to 8 hours, completing the full loop from planning and execution to iterative optimization and delivery. GLM-5.1 achieves state-of-the-art on SWE-Bench Pro (58.4) and demonstrates strong performance across coding, reasoning, and agentic benchmarks. It supports 200K context length, 128K max output tokens, thinking mode, function calling, structured output, context caching, and MCP integration. Overall performance is aligned with Claude Opus 4.6 with particular strengths in sustained execution and complex engineering optimization.

Fecha de lanzamiento

2026-04-07

Parámetros

754.0B

Longitud del contexto

200K

Modalidades

text

Radar de capacidades

general

coding

reasoning

scienceest.

agents

multimodal

Science usa un proxy de razonamiento cuando los benchmarks científicos dedicados no están disponibles.

Rankings

Dominio	#Posición	Puntuación	Fuente
Capacidad agéntica	33	60.0	LS
Ranking de codificación	60	74.0	AA
Ranking general	21	81.0	AA
Ciencia	43	72.0	AA

Puntuaciones de benchmarks (LLM Stats)

Agents

Vending-Bench 2

563441.0%Aut.

GDPval-AA

1281.00 / 3000Aut.

BrowseComp

79.3%Aut.

MCP Atlas

71.8%Aut.

TAU3-Bench

70.6%Aut.

Terminal-Bench 2.0

69.0%Aut.

CyberGym

68.7%Aut.

SWE-Bench Pro

58.4%Aut.

Finance Agent v2

44.8%Aut.

NL2Repo

42.7%Aut.

Toolathlon

40.7%Aut.

FrontierSWE

31.0%Aut.

Biology

GPQA

86.2%Aut.

General

LiveBench

70.2%Aut.

Math

AIME 2026

95.3%Aut.

HMMT 2025

94.0%Aut.

IMO-AnswerBench

83.8%Aut.

HMMT Feb 26

82.6%Aut.

Humanity's Last Exam

52.3%Aut.

Índices de evaluación AA

Coding Index

55.8

Intelligence Index

40.2

Tau2

1.0

Gpqa

0.9

Ifbench

0.8

Lcr

0.6

Terminalbench V2 1

0.6

Scicode

0.4

Terminalbench Hard

0.4

Hle

0.3

Tau Banking

0.1

Puntuaciones por categoría LLM Stats

Legal

100

Finance

100

Agents

100

Reasoning

100

General

100

Physics

Biology

Chemistry

Math

Safety

Code

Tool Calling

Vision

Coding

Precios

Precio de entrada$1.4 / 1M tokens

Precio de salida$4.4 / 1M tokens

Precio mixto (3:1)$2.15 / 1M tokens

Precio de lectura caché$0.26 / 1M tokens

Precio de escritura cachéGratis

Velocidad

Tokens/seg99.8

Retraso del primer token0.80s

Tiempo hasta la respuesta38.80s

Ranking de Precios por Proveedor

25 proveedores

Más barato: ZAIMás caro: Merge Gateway

ProveedorEntradaSalida

1ZAIMás barato

2FriendliAI

3NanoGPT

$0.3

$2.55

4HPC-AI

$0.615

$2.46

5ZenMux

$0.8781

$3.5126

6Lilac

$0.9

7OpenRouter

$0.98

$3.08

8Hugging Face

$3.2

9Wafer

$3.2

10Synthetic

11routing.run

12Deep Infra

$1.05

$3.5

13FastRouter

$1.05

$3.5

14Kilo Gateway

$1.26

$3.96

15Baseten

$1.3

$4.3

16Z AIPRINCIPAL

$1.4

$4.4

17SiliconFlow (China)

$1.4

$4.4

18NovitaAI

$1.4

$4.4

19Weights & Biases

$1.4

$4.4

20Friendli

$1.4

$4.4

21SiliconFlow

$1.4

$4.4

22Vercel AI Gateway

$1.4

$4.4

23Together AI

$1.4

$4.4

24OrcaRouter

$1.4

$4.4

25Merge Gateway

$1.4

$4.4

Comparar precios entre diferentes proveedores de API para este modelo.

Fuentes externas

LLM Stats Artificial Analysis