Saltar al contenido principal

Kimi K2.5 (Reasoning)

KimiKimi
Fecha de lanzamiento
2026-01-27
Parámetros
Longitud del contexto
262K
Modalidades
image, text, video

Radar de capacidades

36
general
49
coding
88
reasoning
63
scienceest.
50
agents
80
multimodal

Science usa un proxy de razonamiento cuando los benchmarks científicos dedicados no están disponibles.

Rankings

Dominio#PosiciónPuntuaciónFuente
Ranking de codificación81
71.0
AA
Ranking general46
76.0
AA
Ciencia35
76.0
AA

Puntuaciones de benchmarks (LLM Stats)

Agents

WideSearch79.0%Aut.
DeepSearchQA77.1%Aut.
BrowseComp74.9%Aut.
PaperBench63.5%Aut.
Terminal-Bench 2.050.8%Aut.
SWE-Bench Pro50.7%Aut.
CyberGym41.3%Aut.
FrontierSWE26.0%Aut.

Biology

GPQA87.6%Aut.
SciCode48.7%Aut.

Code

SWE-Bench Verified76.8%Aut.
SWE-bench Multilingual73.0%Aut.
OJBench (C++)57.4%Aut.

Economics

FinSearchComp T2&T367.8%Aut.

Finance

MMLU-Pro87.1%Aut.

General

LiveCodeBench v685.0%Aut.
MMMU-Pro78.5%Aut.
SimpleVQA0.71 / 100Aut.
LiveBench69.1%Aut.
LongBench v261.0%Aut.

Healthcare

VideoMMMU86.6%Aut.

Image To Text

OCRBench92.3%Aut.

Long Context

LongVideoBench79.8%Aut.
LVBench75.9%Aut.
AA-LCR70.0%Aut.

Math

AIME 202596.1%Aut.
HMMT 202595.4%Aut.
MathVista-Mini90.1%Aut.
MathVision84.2%Aut.
IMO-AnswerBench81.8%Aut.
Humanity's Last Exam50.2%Aut.

Multimodal

InfoVQAtest92.6%Aut.
OmniDocBench 1.588.8%Aut.
Video-MME87.4%Aut.
MMVU80.4%Aut.
CharXiv-R77.5%Aut.
MotionBench70.4%Aut.
WorldVQA46.3%Aut.
ZEROBench0.11 / 100Aut.

Reasoning

Seal-057.4%Aut.

Índices de evaluación AA

Intelligence Index
38.1
Tau2
1.0
Gpqa
0.9
Ifbench
0.7
Lcr
0.7
Scicode
0.5
Terminalbench Hard
0.3
Hle
0.3

Puntuaciones por categoría LLM Stats

Language
90
Legal
90
Finance
90
Image To Text
80
Long Context
80
Math
80
Multimodal
80
Frontend Development
80
Video
80
Vision
80
Physics
70
Reasoning
70
Search
70
Structured Output
70
General
70
Healthcare
70
Biology
70
Chemistry
70
Agents
60
Code
50
Tool Calling
50
Safety
40

Precios

Precio de entrada$0.58 / 1M tokens
Precio de salida$3 / 1M tokens
Precio mixto (3:1)$1.185 / 1M tokens
Precio de lectura caché$0.1 / 1M tokens

Velocidad

Tokens/seg47.2
Retraso del primer token1.19s
Tiempo hasta la respuesta64.11s

Ranking de Precios por Proveedor

Ranking de Precios por Proveedor

30 proveedores

Más barato: NanoGPTMás caro: evroc
ProveedorEntradaSalida
1NanoGPTMás barato
$0.3
$1.9
2HPC-AI
$0.3
$1.5
3OpenRouter
$0.375
$2.025
4Chutes
$0.44
$2
5SiliconFlow (China)
$0.45
$2.25
6Deep Infra
$0.45
$2.25
7Kilo Gateway
$0.45
$2.2
8Meganova
$0.45
$2.8
9SiliconFlow
$0.45
$2.25
10routing.run
$0.462
$2.42
11Weights & Biases
$0.5
$2.85
12Nebius Token Factory
$0.5
$2.5
13Together AI
$0.5
$2.8
14Neuralwatt
$0.52
$2.59
15Synthetic
$0.55
$2.19
16Venice AI
$0.56
$3.5
17KimiPRINCIPAL
$0.58
$3
18ZenMux
$0.58
$3.02
19Alibaba (China)
$0.6
$3
20Jiekou.AI
$0.6
$3
21Hugging Face
$0.6
$3
22NovitaAI
$0.6
$3
23Cloudflare AI Gateway
$0.6
$3
24Amazon Bedrock
$0.6
$3
25Baseten
$0.6
$3
26Vercel AI Gateway
$0.6
$3
27OrcaRouter
$0.6
$3
28Merge Gateway
$0.6
$3
29CrofAI
$1
$3
30evroc
$1.47
$5.9

Comparar precios entre diferentes proveedores de API para este modelo.

Fuentes externas