Qwen2.5 Coder Instruct 7B

AlibabaQwenOpen WeightApache 2.0 · Commercial OK

Description

Qwen2.5-Coder is a specialized coding model trained on 5.5 trillion tokens of code data, supporting 92 programming languages with a 128K context window. It excels in code generation, completion, and repair while maintaining strong performance in math and general tasks. The model demonstrates exceptional capabilities in multi-programming language tasks and code reasoning.

Date de sortie

2024-09-19

Paramètres

7.0B

Longueur du contexte

33K

Modalités

text

Radar de capacités

general

coding

reasoning

scienceest.

agents

multimodal

Science utilise un proxy de raisonnement lorsque les benchmarks scientifiques dédiés ne sont pas disponibles.

Classements

Domaine	#Rang	Score	Source
Code Ranking	399	14.0	AA
General Ranking	424	23.0	AA
Math Reasoning	250	35.0	AA
Reasoning	58	63.0	LS
Science	418	21.0	AA

Scores de benchmarks (LLM Stats)

Code

HumanEval

88.4%Aut.

Aider

55.6%Aut.

LiveCodeBench

18.2%Aut.

Finance

MMLU-Base

68.0%Aut.

MMLU

67.6%Aut.

TruthfulQA

50.6%Aut.

MMLU-Pro

40.1%Aut.

TheoremQA

34.0%Aut.

General

MBPP

0.83 / 100Aut.

MMLU-Redux

66.6%Aut.

ARC-C

60.9%Aut.

BigCodeBench

41.0%Aut.

Language

Winogrande

72.9%Aut.

Math

GSM8k

83.9%Aut.

MATH

46.6%Aut.

STEM

34.0%Aut.

Reasoning

HellaSwag

76.8%Aut.

CRUXEval-Input-CoT

56.5%Aut.

CRUXEval-Output-CoT

56.0%Aut.

Indices d'évaluation AA

Intelligence Index

10.0

Math 500

0.7

Mmlu Pro

0.5

Gpqa

0.3

Scicode

0.1

Livecodebench

0.1

Aime

0.1

Hle

0.0

Scores par catégorie LLM Stats

General

Language

Math

Reasoning

Code

Finance

Healthcare

Legal

Physics

Tarification

Prix d'entréeGratuit

Prix de sortieGratuit

Prix mixte (3:1)Gratuit

Vitesse

Tokens/sec0.0 tokens/s

Délai du premier token0.00s

Temps de réponse0.00s

Fournisseurs disponibles

(Unités internes LS)

Aucune donnée de fournisseur disponible

Sources externes

LLM Stats