GLM-4.5-Air

Z AIGLMOpen WeightMIT · Commercial OK

Description

GLM-4.5-Air is a more compact variant of GLM-4.5 designed for efficient Agentic, Reasoning, and Coding (ARC) applications. It features 106 billion total parameters with 12 billion active parameters using MoE architecture. Like GLM-4.5, it is a hybrid reasoning model providing thinking mode for complex reasoning and tool usage, and non-thinking mode for immediate responses. Despite its compact design, GLM-4.5-Air delivers competitive performance with a score of 59.8 across 12 industry-standard benchmarks, ranking 6th overall while maintaining superior efficiency. It supports 128K context length and is released under MIT open-source license allowing commercial use.

Release Date

2025-07-28

Parameters

106.0B

Context Length

131K

Modalities

text

Capability Radar

general

coding

reasoning

scienceest.

agents

multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain	#Rank	Score	Source
Agentic Capability	84	51.0	LS
Code Ranking	157	55.0	AA
General Ranking	238	45.0	AA
Math Reasoning	76	82.0	AA
Science	232	46.0	AA

Benchmark Scores (LLM Stats)

Agents

BFCL-v3

76.4%SR

Terminal-Bench

30.0%SR

BrowseComp

21.3%SR

Biology

GPQA

75.0%SR

SciCode

37.3%SR

Code

LiveCodeBench

70.7%SR

SWE-Bench Verified

57.6%SR

Communication

TAU-bench Retail

77.9%SR

TAU-bench Airline

60.8%SR

Finance

MMLU-Pro

81.4%SR

General

AA-Index

64.8%SR

Math

MATH-500

98.1%SR

AIME 2024

89.4%SR

Humanity's Last Exam

10.6%SR

AA Evaluation Indices

Math Index

80.7

Intelligence Index

16.5

Math 500

1.0

Mmlu Pro

0.8

Aime 25

0.8

Gpqa

0.7

Livecodebench

0.7

Aime

0.7

Tau2

0.5

Lcr

0.4

Ifbench

0.4

Scicode

0.3

Terminalbench Hard

0.2

Hle

0.1

LLM Stats Category Scores

Structured Output

Language

Legal

Finance

Healthcare

General

Communication

Tool Calling

Math

Physics

Reasoning

Frontend Development

Biology

Chemistry

Code

Agents

Vision

Pricing

Input Price$0.17 / 1M tokens

Output Price$0.98 / 1M tokens

Blended Price (3:1)$0.372 / 1M tokens

Cache Read Price$0.03 / 1M tokens

Cache Write PriceFree

Speed

Tokens/sec87.5

Time to First Token1.61s

Time to Answer24.48s

Provider Price Ranking

17 providers

Cheapest: submodelMost Expensive: LLM Gateway

ProviderInputOutput

1submodelCheapest

$0.1

$0.5

2ZenMux

$0.11

$0.56

3OpenRouter

$0.13

$0.85

4NovitaAI

$0.13

$0.85

5Kilo Gateway

$0.13

$0.85

6SiliconFlow (China)

$0.14

$0.86

7SiliconFlow

$0.14

$0.86

8Z AIPRIMARY

$0.17

$0.98

9Z.AI

$0.2

$1.1

10Vercel AI Gateway

$0.2

$1.1

11Zhipu AI

$0.2

$1.1

12OrcaRouter

$0.2

$1.1

13Merge Gateway

$0.2

$1.1

14NanoGPT

$0.2006

15Cortecs

$0.22

$1.34

16302.AI

$0.572

$1.714

17LLM Gateway

$1.1

$4.5

Compare pricing across different API providers for this model.

External Sources

LLM Stats Artificial Analysis