K2 Think V2

MBZUAI Institute of Foundation Models

Release Date

2025-12-15

Parameters

—

Context Length

—

Modalities

—

Capability Radar

general

coding

reasoning

scienceest.

agents

multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain	#Rank	Score	Source
Code Ranking	268	34.0	AA
General Ranking	281	38.0	AA
Science	203	48.0	AA

Benchmark Scores (LLM Stats)

No benchmark data available

AA Evaluation Indices

Coding Index

21.0

Intelligence Index

17.4

Gpqa

0.7

Ifbench

0.6

Lcr

0.5

Scicode

0.3

Tau2

0.3

Terminalbench V2 1

0.1

Hle

0.1

Terminalbench Hard

0.1

Tau Banking

0.1

LLM Stats Category Scores

No category score data available

Pricing

Input PriceFree

Output PriceFree

Blended Price (3:1)Free

Speed

Tokens/sec0.0

Time to First Token0.00s

Time to Answer0.00s

Provider Price Ranking

1 providers

ProviderInputOutput

1NanoGPT

$0.17

$0.68

Compare pricing across different API providers for this model.

External Sources

Artificial Analysis