Skip to main content

Kimi K2.5 (Non-reasoning)

KimiKimiOpen WeightMIT · Commercial OK

Description

Kimi K2.5 is Moonshot AI's flagship agentic model and a new SOTA open model. It unifies vision and text, thinking and non-thinking modes, and single-agent and multi-agent execution into one model. Built with Full-Parameter RL tuning, it achieves state-of-the-art performance across agents, coding, image, and video benchmarks.

Release Date
2026-01-27
Parameters
1.0T
Context Length
262K
Modalities
image, text, video

Capability Radar

26
general
40
coding
79
reasoning
52
scienceest.
50
agents
80
multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain#RankScoreSource
Agentic Capability42
59.0
LS
Code Ranking168
54.0
AA
General Ranking157
56.0
AA
Multimodal Ranking66
71.0
LS
Reasoning72
57.0
LS
Science139
56.0
AA

Benchmark Scores (LLM Stats)

Agents

WideSearch79.0%SR
DeepSearchQA77.1%SR
BrowseComp74.9%SR
PaperBench63.5%SR
Terminal-Bench 2.050.8%SR
SWE-Bench Pro50.7%SR
CyberGym41.3%SR
FrontierSWE26.0%SR

Biology

GPQA87.6%SR
SciCode48.7%SR

Code

SWE-Bench Verified76.8%SR
SWE-bench Multilingual73.0%SR
OJBench (C++)57.4%SR

Economics

FinSearchComp T2&T367.8%SR

Finance

MMLU-Pro87.1%SR

General

LiveCodeBench v685.0%SR
MMMU-Pro78.5%SR
SimpleVQA0.71 / 100SR
LiveBench69.1%SR
LongBench v261.0%SR

Healthcare

VideoMMMU86.6%SR

Image To Text

OCRBench92.3%SR

Long Context

LongVideoBench79.8%SR
LVBench75.9%SR
AA-LCR70.0%SR

Math

AIME 202596.1%SR
HMMT 202595.4%SR
MathVista-Mini90.1%SR
MathVision84.2%SR
IMO-AnswerBench81.8%SR
Humanity's Last Exam50.2%SR

Multimodal

InfoVQAtest92.6%SR
OmniDocBench 1.588.8%SR
Video-MME87.4%SR
MMVU80.4%SR
CharXiv-R77.5%SR
MotionBench70.4%SR
WorldVQA46.3%SR
ZEROBench0.11 / 100SR

Reasoning

Seal-057.4%SR

AA Evaluation Indices

Intelligence Index
29.4
Tau2
0.8
Gpqa
0.8
Lcr
0.6
Ifbench
0.4
Scicode
0.4
Terminalbench Hard
0.2
Hle
0.1

LLM Stats Category Scores

Language
90
Legal
90
Finance
90
Image To Text
80
Long Context
80
Math
80
Multimodal
80
Frontend Development
80
Video
80
Vision
80
Physics
70
Reasoning
70
Search
70
Structured Output
70
General
70
Healthcare
70
Biology
70
Chemistry
70
Agents
60
Code
50
Tool Calling
50
Safety
40

Pricing

Input Price$0.6 / 1M tokens
Output Price$3 / 1M tokens
Blended Price (3:1)$1.2 / 1M tokens
Cache Read Price$0.1 / 1M tokens

Speed

Tokens/sec37.8
Time to First Token1.25s
Time to Answer1.25s

Provider Price Ranking

Provider Price Ranking

17 providers

Cheapest: NanoGPTMost Expensive: Moonshot AI
ProviderInputOutput
1NanoGPTCheapest
$0.3
$1.9
2CrofAI
$0.35
$1.7
3DigitalOcean
$0.5
$2.7
4Auriko
$0.5
$2.8
5Cortecs
$0.55
$2.76
6Alibaba (China)
$0.574
$2.411
7KimiPRIMARY
$0.6
$3
8Abacus
$0.6
$3
9OpenCode Go
$0.6
$3
10OpenCode Zen
$0.6
$3
11FrogBot
$0.6
$3
12AIHubMix
$0.6
$3
13Moonshot AI (China)
$0.6
$3
14Azure Cognitive Services
$0.6
$3
15LLM Gateway
$0.6
$3
16Azure
$0.6
$3
17Moonshot AI
$0.6
$3

Compare pricing across different API providers for this model.

External Sources