QwQ 32B-Preview

Alibaba

ReasoningOpen

Context

Release Date: Nov 27, 2024
Window: 32,768

PricingPer 1M tokens

Input: $0.2
Output: $0.2
Blended 3:1: $0.2

Capabilities

Speed: 51 t/s
Input
Output
Reasoning tokens

Latency

TTFT: 0.55 ms
500 token response: 49.60 s

Benchmarks

Reasoning: ●●○○○
Math: ●●●○○
Coding: ●○○○○
MMLU Pro: 64.8%
GPQA: 55.7%
HLE: 4.8%
SciCode: 3.8%
AIME: 45.3%
MATH 500: 91.0%
LiveCodeBench: 33.7%
HumanEval: 86.7%

QwQ 32B-Preview