Skip to main content

GLM-4.5-Air

Z AIGLMOpen WeightMIT · Commercial OK

Description

GLM-4.5-Air is a more compact variant of GLM-4.5 designed for efficient Agentic, Reasoning, and Coding (ARC) applications. It features 106 billion total parameters with 12 billion active parameters using MoE architecture. Like GLM-4.5, it is a hybrid reasoning model providing thinking mode for complex reasoning and tool usage, and non-thinking mode for immediate responses. Despite its compact design, GLM-4.5-Air delivers competitive performance with a score of 59.8 across 12 industry-standard benchmarks, ranking 6th overall while maintaining superior efficiency. It supports 128K context length and is released under MIT open-source license allowing commercial use.

Release Date
2025-07-28
Parameters
106.0B
Context Length
131K
Modalities
text

Capability Radar

35
general
60
coding
79
reasoning
45
scienceest.
70
agents
0
multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain#RankScoreSource
Agentic Capability84
51.0
LS
Code Ranking157
55.0
AA
General Ranking238
45.0
AA
Math Reasoning76
82.0
AA
Science232
46.0
AA

Benchmark Scores (LLM Stats)

Agents

BFCL-v376.4%SR
Terminal-Bench30.0%SR
BrowseComp21.3%SR

Biology

GPQA75.0%SR
SciCode37.3%SR

Code

LiveCodeBench70.7%SR
SWE-Bench Verified57.6%SR

Communication

TAU-bench Retail77.9%SR
TAU-bench Airline60.8%SR

Finance

MMLU-Pro81.4%SR

General

AA-Index64.8%SR

Math

MATH-50098.1%SR
AIME 202489.4%SR
Humanity's Last Exam10.6%SR

AA Evaluation Indices

Math Index
80.7
Intelligence Index
16.5
Math 500
1.0
Mmlu Pro
0.8
Aime 25
0.8
Gpqa
0.7
Livecodebench
0.7
Aime
0.7
Tau2
0.5
Lcr
0.4
Ifbench
0.4
Scicode
0.3
Terminalbench Hard
0.2
Hle
0.1

LLM Stats Category Scores

Structured Output
80
Language
80
Legal
80
Finance
80
Healthcare
80
General
70
Communication
70
Tool Calling
70
Math
60
Physics
60
Reasoning
60
Frontend Development
60
Biology
60
Chemistry
60
Code
50
Agents
40
Search
20
Vision
10

Pricing

Input Price$0.17 / 1M tokens
Output Price$0.98 / 1M tokens
Blended Price (3:1)$0.372 / 1M tokens
Cache Read Price$0.03 / 1M tokens
Cache Write PriceFree

Speed

Tokens/sec87.5
Time to First Token1.61s
Time to Answer24.48s

Provider Price Ranking

Provider Price Ranking

17 providers

Cheapest: submodelMost Expensive: LLM Gateway
ProviderInputOutput
1submodelCheapest
$0.1
$0.5
2ZenMux
$0.11
$0.56
3OpenRouter
$0.13
$0.85
4NovitaAI
$0.13
$0.85
5Kilo Gateway
$0.13
$0.85
6SiliconFlow (China)
$0.14
$0.86
7SiliconFlow
$0.14
$0.86
8Z AIPRIMARY
$0.17
$0.98
9Z.AI
$0.2
$1.1
10Vercel AI Gateway
$0.2
$1.1
11Zhipu AI
$0.2
$1.1
12OrcaRouter
$0.2
$1.1
13Merge Gateway
$0.2
$1.1
14NanoGPT
$0.2006
$0.2006
15Cortecs
$0.22
$1.34
16302.AI
$0.572
$1.714
17LLM Gateway
$1.1
$4.5

Compare pricing across different API providers for this model.

External Sources