LongCat Flash Lite

LongCatOpen WeightMIT · Usage Commercial

Description

LongCat-Flash-Lite is a lightweight MoE model from Meituan with 68.5B total parameters and only 2.9B-4.5B activated per token. It explores N-gram embedding expansion as a new scaling direction, supporting 256K context length via YaRN. Optimized for agent tooling and programming tasks, achieving 500-700 tokens per second inference speed while maintaining strong performance on coding, math, and agentic benchmarks.

Date de sortie

2026-01-28

Paramètres

68.5B

Longueur du contexte

—

Modalités

text

Radar de capacités

general

coding

reasoning

scienceest.

agents

multimodal

Science utilise un proxy de raisonnement lorsque les benchmarks scientifiques dédiés ne sont pas disponibles.

Classements

Domaine	#Rang	Score	Source
Capacité agentique	114	34.0	LS
Classement codage	321	26.0	AA
Classement général	231	45.0	AA
Science	283	40.0	AA

Scores de benchmarks (LLM Stats)

Agents

Terminal-Bench

33.8%Aut.

Biology

GPQA

66.8%Aut.

Code

SWE-Bench Verified

54.4%Aut.

SWE-bench Multilingual

38.1%Aut.

Communication

Tau2 Retail

73.1%Aut.

Tau2 Telecom

72.8%Aut.

Tau2 Airline

58.0%Aut.

Finance

MMLU

85.5%Aut.

MMLU-Pro

78.3%Aut.

General

CMMLU

82.5%Aut.

Math

MATH-500

96.8%Aut.

AIME 2024

72.2%Aut.

AIME 2025

63.2%Aut.

Indices d'évaluation AA

Intelligence Index

17.2

Tau2

0.8

Gpqa

0.6

Ifbench

0.4

Scicode

0.3

Lcr

0.3

Terminalbench Hard

0.1

Hle

0.1

Scores par catégorie LLM Stats

Language

Legal

Math

Finance

General

Healthcare

Physics

Reasoning

Biology

Chemistry

Communication

Tool Calling

Frontend Development

Code

Agents

Tarification

Prix d'entréeGratuit

Prix de sortieGratuit

Prix mixte (3:1)Gratuit

Vitesse

Tokens/sec0.0

Délai du premier token0.00s

Temps de réponse0.00s

Classement des Prix par Fournisseur

1 fournisseurs

FournisseurEntréeSortie

1Meituan

Comparer les prix entre différents fournisseurs API pour ce modèle.

Sources externes

LLM Stats Artificial Analysis