Command Palette

Search for a command to run...

QwQ 32B-Preview

Alibaba

ReasoningOpen

Context

Release Date
Nov 27, 2024
Window
32,768

PricingPer 1M tokens

Input
$0.2
Output
$0.2
Blended 3:1
$0.2

Capabilities

Speed
51 t/s
Input
Output
Reasoning tokens

Latency

TTFT
0.55 ms
500 token response
49.60 s

Benchmarks

Reasoning
●●○○○
Math
●●●○○
Coding
○○○○
MMLU Pro
64.8%
GPQA
55.7%
HLE
4.8%
SciCode
3.8%
AIME
45.3%
MATH 500
91.0%
LiveCodeBench
33.7%
HumanEval
86.7%

QwQ 32B-Preview