Saltar al contenido principal

Kimi K2.5 (Non-reasoning)

KimiKimiOpen WeightMIT · Uso Comercial

Descripción

Kimi K2.5 is Moonshot AI's flagship agentic model and a new SOTA open model. It unifies vision and text, thinking and non-thinking modes, and single-agent and multi-agent execution into one model. Built with Full-Parameter RL tuning, it achieves state-of-the-art performance across agents, coding, image, and video benchmarks.

Fecha de lanzamiento
2026-01-27
Parámetros
1.0T
Longitud del contexto
262K
Modalidades
image, text, video

Radar de capacidades

26
general
40
coding
79
reasoning
52
scienceest.
50
agents
80
multimodal

Science usa un proxy de razonamiento cuando los benchmarks científicos dedicados no están disponibles.

Rankings

Dominio#PosiciónPuntuaciónFuente
Capacidad agéntica42
59.0
LS
Ranking de codificación168
54.0
AA
Ranking general157
56.0
AA
Ranking multimodal66
71.0
LS
Razonamiento72
57.0
LS
Ciencia139
56.0
AA

Puntuaciones de benchmarks (LLM Stats)

Agents

WideSearch79.0%Aut.
DeepSearchQA77.1%Aut.
BrowseComp74.9%Aut.
PaperBench63.5%Aut.
Terminal-Bench 2.050.8%Aut.
SWE-Bench Pro50.7%Aut.
CyberGym41.3%Aut.
FrontierSWE26.0%Aut.

Biology

GPQA87.6%Aut.
SciCode48.7%Aut.

Code

SWE-Bench Verified76.8%Aut.
SWE-bench Multilingual73.0%Aut.
OJBench (C++)57.4%Aut.

Economics

FinSearchComp T2&T367.8%Aut.

Finance

MMLU-Pro87.1%Aut.

General

LiveCodeBench v685.0%Aut.
MMMU-Pro78.5%Aut.
SimpleVQA0.71 / 100Aut.
LiveBench69.1%Aut.
LongBench v261.0%Aut.

Healthcare

VideoMMMU86.6%Aut.

Image To Text

OCRBench92.3%Aut.

Long Context

LongVideoBench79.8%Aut.
LVBench75.9%Aut.
AA-LCR70.0%Aut.

Math

AIME 202596.1%Aut.
HMMT 202595.4%Aut.
MathVista-Mini90.1%Aut.
MathVision84.2%Aut.
IMO-AnswerBench81.8%Aut.
Humanity's Last Exam50.2%Aut.

Multimodal

InfoVQAtest92.6%Aut.
OmniDocBench 1.588.8%Aut.
Video-MME87.4%Aut.
MMVU80.4%Aut.
CharXiv-R77.5%Aut.
MotionBench70.4%Aut.
WorldVQA46.3%Aut.
ZEROBench0.11 / 100Aut.

Reasoning

Seal-057.4%Aut.

Índices de evaluación AA

Intelligence Index
29.4
Tau2
0.8
Gpqa
0.8
Lcr
0.6
Ifbench
0.4
Scicode
0.4
Terminalbench Hard
0.2
Hle
0.1

Puntuaciones por categoría LLM Stats

Language
90
Legal
90
Finance
90
Image To Text
80
Long Context
80
Math
80
Multimodal
80
Frontend Development
80
Video
80
Vision
80
Physics
70
Reasoning
70
Search
70
Structured Output
70
General
70
Healthcare
70
Biology
70
Chemistry
70
Agents
60
Code
50
Tool Calling
50
Safety
40

Precios

Precio de entrada$0.6 / 1M tokens
Precio de salida$3 / 1M tokens
Precio mixto (3:1)$1.2 / 1M tokens
Precio de lectura caché$0.1 / 1M tokens

Velocidad

Tokens/seg37.8
Retraso del primer token1.25s
Tiempo hasta la respuesta1.25s

Ranking de Precios por Proveedor

Ranking de Precios por Proveedor

17 proveedores

Más barato: NanoGPTMás caro: Moonshot AI
ProveedorEntradaSalida
1NanoGPTMás barato
$0.3
$1.9
2CrofAI
$0.35
$1.7
3DigitalOcean
$0.5
$2.7
4Auriko
$0.5
$2.8
5Cortecs
$0.55
$2.76
6Alibaba (China)
$0.574
$2.411
7KimiPRINCIPAL
$0.6
$3
8Abacus
$0.6
$3
9OpenCode Go
$0.6
$3
10OpenCode Zen
$0.6
$3
11FrogBot
$0.6
$3
12AIHubMix
$0.6
$3
13Moonshot AI (China)
$0.6
$3
14Azure Cognitive Services
$0.6
$3
15LLM Gateway
$0.6
$3
16Azure
$0.6
$3
17Moonshot AI
$0.6
$3

Comparar precios entre diferentes proveedores de API para este modelo.

Fuentes externas