Qwen2.5 Coder Instruct 7B

AlibabaQwenOpen WeightApache 2.0 · Commercial OK

Description

Qwen2.5-Coder is a specialized coding model trained on 5.5 trillion tokens of code data, supporting 92 programming languages with a 128K context window. It excels in code generation, completion, and repair while maintaining strong performance in math and general tasks. The model demonstrates exceptional capabilities in multi-programming language tasks and code reasoning.

Release Date

2024-09-19

Parameters

7.0B

Context Length

33K

Modalities

text

Capability Radar

general

coding

reasoning

scienceest.

agents

multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain	#Rank	Score	Source
Code Ranking	399	14.0	AA
General Ranking	424	23.0	AA
Math Reasoning	250	35.0	AA
Reasoning	58	63.0	LS
Science	418	21.0	AA

Benchmark Scores (LLM Stats)

Code

HumanEval

88.4%SR

Aider

55.6%SR

LiveCodeBench

18.2%SR

Finance

MMLU-Base

68.0%SR

MMLU

67.6%SR

TruthfulQA

50.6%SR

MMLU-Pro

40.1%SR

TheoremQA

34.0%SR

General

MBPP

0.83 / 100SR

MMLU-Redux

66.6%SR

ARC-C

60.9%SR

BigCodeBench

41.0%SR

Language

Winogrande

72.9%SR

Math

GSM8k

83.9%SR

MATH

46.6%SR

STEM

34.0%SR

Reasoning

HellaSwag

76.8%SR

CRUXEval-Input-CoT

56.5%SR

CRUXEval-Output-CoT

56.0%SR

AA Evaluation Indices

Intelligence Index

10.0

Math 500

0.7

Mmlu Pro

0.5

Gpqa

0.3

Scicode

0.1

Livecodebench

0.1

Aime

0.1

Hle

0.0

LLM Stats Category Scores

General

Language

Math

Reasoning

Code

Finance

Healthcare

Legal

Physics

Pricing

Input PriceFree

Output PriceFree

Blended Price (3:1)Free

Speed

Tokens/sec0.0 tokens/s

Time to First Token0.00s

Time to Answer0.00s

Available Providers

(LS internal units)

No provider data available

External Sources

LLM Stats