BE
← Leaderboard

Qwen3 235B

Open source
Alibaba (Qwen)
Open license
text
Qwen 3Released 1y ago
Avg score
66.1
/ 100
Context
131k
Output limit
16k
Input price
$0.20 /M
Output price
$0.60 /M

Pricing verified 1y ago · source

Benchmarks

preference

Crowdsourced pairwise human preference rankings of LLM responses. Higher Elo means more frequently preferred by users.

math

American Invitational Mathematics Examination 2024 problems. Three-digit integer answers; very hard for non-reasoning models.

coding

HumanEval% pass@1

164 hand-written Python programming problems scored by passing unit tests. Saturated for frontier models.

Continuously refreshed coding benchmark drawing from LeetCode, AtCoder, and Codeforces; reduces benchmark contamination.

agentic

Real GitHub issues solved end-to-end. Verified subset is a 500-task human-validated slice of SWE-bench.

long context

Long-context retrieval and reasoning suite. We report the 128k token effective-context score.

performance

Median sustained output speed in tokens per second on the model's first-party API for medium-length prompts. Higher is faster.

Median time from request to first output chunk in milliseconds on the model's first-party API for medium-length prompts. Lower is snappier; reasoning models are penalised here because they think before talking.

Providers

ProviderInput $/MOutput $/MContextQuant
DeepInfra
deepinfra/fp8
$0.07$0.10262kfp8
WandB
wandb/bf16
$0.10$0.10262kbf16
Novita
novita/fp8
$0.09$0.58131kfp8
Alibaba
alibaba
$0.15$0.60131kunknown
SiliconFlow
siliconflow/fp8
$0.09$0.60262kfp8
Parasail
parasail/fp8
$0.10$0.60131kfp8
Together
together
$0.20$0.60262kunknown
Friendli
friendli
$0.20$0.80262kunknown
AtlasCloud
atlas-cloud/fp8
$0.20$0.88131kfp8
Google
google-vertex
$0.22$0.88262kunknown
Google
google-vertex
$0.25$1.00262kunknown
Cerebras
cerebras/fp16
$0.60$1.20131kfp16
Sourced from OpenRouter. Sorted by lowest output price.

Compare with...