LongCat-Flash-Chat

Meituanओपन वेटMIT · व्यावसायिक उपयोग

विवरण

LongCat-Flash-Chat is Meituan's first open-source foundation model, a 560B parameter Mixture-of-Experts (MoE) model that dynamically activates 18.6B-31.3B parameters (~27B average) based on contextual demands. It features Zero-Computation Experts for efficient routing and supports 128K context. Optimized for conversational and agentic tasks, it shows competitive performance across reasoning, coding, instruction following, and domain benchmarks with particular strengths in tool use and complex multi-step interactions. Achieves over 100 tokens per second on H800 GPUs.

रिलीज़ तिथि

2025-08-29

पैरामीटर

560.0B

संदर्भ लंबाई

—

मोडैलिटीज़

text

क्षमता रडार

general

coding

reasoning

scienceअनुमानित

agents

multimodal

समर्पित विज्ञान बेंचमार्क उपलब्ध न होने पर Science तर्क प्रॉक्सी का उपयोग करके अनुमान लगाता है।

रैंकिंग

डोमेन	#रैंक	स्कोर	स्रोत
एजेंटिक क्षमता	103	40.0	LS
तर्क	11	89.0	LS

बेंचमार्क स्कोर (LLM Stats)

Agents

Terminal-Bench

39.5%स्वयं

Biology

GPQA

73.2%स्वयं

Code

HumanEval

88.4%स्वयं

SWE-Bench Verified

60.4%स्वयं

LiveCodeBench

48.0%स्वयं

Communication

Tau2 Telecom

73.7%स्वयं

Tau2 Retail

71.3%स्वयं

Tau2 Airline

58.0%स्वयं

Finance

MMLU

89.7%स्वयं

MMLU-Pro

82.7%स्वयं

General

IFEval

89.6%स्वयं

CMMLU

84.3%स्वयं

Math

MATH-500

96.4%स्वयं

DROP

79.1%स्वयं

AIME 2025

61.3%स्वयं

Reasoning

ZebraLogic

89.3%स्वयं

AA मूल्यांकन सूचकांक

कोई AA मूल्यांकन डेटा उपलब्ध नहीं

LLM Stats श्रेणी स्कोर

Instruction Following

Language

Legal

Structured Output

Finance

Healthcare

Math

General

Physics

Reasoning

Biology

Chemistry

Communication

Tool Calling

Frontend Development

Code

Agents

मूल्य निर्धारण

कोई मूल्य डेटा उपलब्ध नहीं

गति

कोई गति डेटा उपलब्ध नहीं

प्रदाता मूल्य रैंकिंग

कोई प्रदाता डेटा उपलब्ध नहीं

बाहरी लिंक

LLM Stats Artificial Analysis