Llama 3.2 Instruct 11B (Vision)

MetaLlama오픈 웨이트Llama 3.2 Community License

설명

Llama 3.2 11B Vision Instruct is an instruction-tuned multimodal large language model optimized for visual recognition, image reasoning, captioning, and answering general questions about an image. It accepts text and images as input and generates text as output.

출시일

2024-09-25

파라미터

10.6B

컨텍스트 길이

131K

모달리티

image, text

능력 레이더

general

coding

reasoning

science추정

agents

multimodal

전용 과학 벤치마크가 없을 때 Science는 추론 프록시를 사용하여 추정합니다.

랭킹

도메인	#순위	점수	소스
코딩 랭킹	453	9.0	AA
종합 랭킹	451	20.0	AA
수학 추론	328	13.0	AA
멀티모달 랭킹	26	84.0	LS
과학	474	14.0	AA

벤치마크 점수 (LLM Stats)

Biology

GPQA

32.8%자체 보고

Finance

MMLU

73.0%자체 보고

General

MMMU

50.7%자체 보고

MMMU-Pro

33.0%자체 보고

Image To Text

DocVQA

88.4%자체 보고

VQAv2 (test)

75.2%자체 보고

Math

MGSM

68.9%자체 보고

MATH

51.9%자체 보고

MathVista

51.5%자체 보고

Multimodal

AI2D

91.1%자체 보고

ChartQA

83.4%자체 보고

AA 평가 지수

Intelligence Index

3.3

Math Index

1.7

Math 500

0.5

Mmlu Pro

0.5

Ifbench

0.3

Gpqa

0.2

Tau2

0.1

Lcr

0.1

Scicode

0.1

Livecodebench

0.1

Aime

0.1

Hle

0.1

Aime 25

0.0

Terminalbench Hard

0.0

LLM Stats 카테고리 점수

Image To Text

Language

Legal

Multimodal

Finance

Vision

Math

Reasoning

Healthcare

General

Physics

Biology

Chemistry

가격

입력 가격$0.245 / 1M 토큰

출력 가격$0.245 / 1M 토큰

혼합 가격 (3:1)$0.245 / 1M 토큰

속도

토큰/초85.7

첫 토큰 지연0.55s

첫 응답 지연0.55s

공급자 가격 순위

10개 공급자

최저가: Cloudflare Workers AI최고가: Azure

공급자입력출력

1Cloudflare Workers AI최저가

$0.0485

$0.676

2Kilo Gateway

$0.049

3Cloudflare AI Gateway

$0.049

$0.68

4Inference

$0.055

5LLM Gateway

$0.07

$0.33

6Vercel AI Gateway

$0.16

7Meta주요

$0.245

8OpenRouter

$0.345

9Azure Cognitive Services

$0.37

10Azure

$0.37

이 모델의 다양한 API 공급자 간 가격 비교.

외부 링크

LLM Stats Artificial Analysis