Command Palette

Search for a command to run...

Qwen3 4B

Alibaba

Open

Context

Release Date
Apr 28, 2025
Window
32,000

PricingPer 1M tokens

Input
$0.11
Output
$0.42
Blended 3:1
$0.1875

Capabilities

Speed
106 t/s
Input
Output
Reasoning tokens

Latency

TTFT
1.05 ms
500 token response
5.75 s

Benchmarks

Intelligence
●●○○○
Math
●●●○○
Coding
○○○○
MMLU Pro
58.6%
GPQA
39.8%
HLE
3.7%
SciCode
16.7%
AIME
21.3%
MATH 500
84.3%
LiveCodeBench
23.3%