K2 Think V2
MBZUAI Institute of Foundation Models
Release Date
2025-12-15
Parameters
—
Context Length
—
Modalities
—
Capability Radar
16
general
23
coding
71
reasoning
46
scienceest.
57
agents
0
multimodal
Science uses a reasoning proxy when dedicated science benchmarks are unavailable.
Rankings
| Domain | #Rank | Score | Source |
|---|---|---|---|
| Code Ranking | 268 | 34.0 | AA |
| General Ranking | 281 | 38.0 | AA |
| Science | 203 | 48.0 | AA |
Benchmark Scores (LLM Stats)
No benchmark data available
AA Evaluation Indices
Coding Index21.0
Intelligence Index17.4
Gpqa0.7
Ifbench0.6
Lcr0.5
Scicode0.3
Tau20.3
Terminalbench V2 10.1
Hle0.1
Terminalbench Hard0.1
Tau Banking0.1
LLM Stats Category Scores
No category score data available
Pricing
Input PriceFree
Output PriceFree
Blended Price (3:1)Free
Speed
Tokens/sec0.0
Time to First Token0.00s
Time to Answer0.00s
Provider Price Ranking
Provider Price Ranking
1 providers
ProviderInputOutput
1NanoGPT
$0.17
$0.68
Compare pricing across different API providers for this model.