BE
← Leaderboard

Llama 4 Scout

Open source
Meta
Open (restricted)
text
vision
Llama 4Released 1y ago
Avg score
55.7
/ 100
Context
10.0M
Output limit
8k
Input price
$0.18 /M
Output price
$0.59 /M

Pricing verified 1y ago · source

Benchmarks

preference

Crowdsourced pairwise human preference rankings of LLM responses. Higher Elo means more frequently preferred by users.

math

AIME 2024High risk
%

American Invitational Mathematics Examination 2024 problems. Three-digit integer answers; very hard for non-reasoning models.

coding

HumanEvalSaturated
% pass@1

164 hand-written Python programming problems scored by passing unit tests. Saturated for frontier models.

vision

MMMUSome risk
%

Massive Multi-discipline Multimodal Understanding; college-exam level questions with images across 30+ subjects.

MathVistaSome risk
%

Math reasoning over visual contexts (charts, figures, geometry).

long context

Long-context retrieval and reasoning suite. We report the 128k token effective-context score.

performance

tok/s

Median sustained output speed in tokens per second on the model's first-party API for medium-length prompts. Higher is faster.

Median time from request to first output chunk in milliseconds on the model's first-party API for medium-length prompts. Lower is snappier; reasoning models are penalised here because they think before talking.

Providers

ProviderInput $/MOutput $/MContextQuant
DeepInfra
deepinfra/fp8
$0.08$0.30328kfp8
Groq
groq
$0.11$0.34131kunknown
Novita
novita/bf16
$0.18$0.59131kbf16
Google
google-vertex
$0.25$0.701.3Munknown
Sourced from OpenRouter. Sorted by lowest output price.

Compare with...