Command Palette

Search for a command to run...

Qwen3 8B (Reasoning)

Alibaba

Frontier ModelReasoningOpen

Context

Release Date
Apr 28, 2025
Window
128k

PricingPer 1M tokens

Input
$0.18
Output
$2.1
Blended 3:1
$0.66

Capabilities

Speed
99 t/s
Input
Output
Reasoning tokens

Latency

TTFT
1.06 ms
500 token response
26.39 s

Benchmarks

Reasoning
●●●○○
Math
●●●●
Coding
●●○○○
MMLU Pro
74.3%
GPQA
58.9%
HLE
4.2%
SciCode
22.6%
AIME
74.7%
MATH 500
90.4%
LiveCodeBench
40.6%