Benched.ai
View
About
Command Palette
Search for a command to run...
Models
QwQ 32B-Preview
Alibaba
Reasoning
Open
Context
Release Date
Nov 27, 2024
Window
32,768
Pricing
Per 1M tokens
Input
$0.2
Output
$0.2
Blended 3:1
$0.2
Capabilities
Speed
51 t/s
Input
Output
Reasoning tokens
Latency
TTFT
0.55 ms
500 token response
49.60 s
Benchmarks
Reasoning
●●○○○
Math
●●●○○
Coding
●○○○○
MMLU Pro
64.8%
GPQA
55.7%
HLE
4.8%
SciCode
3.8%
AIME
45.3%
MATH 500
91.0%
LiveCodeBench
33.7%
HumanEval
86.7%
QwQ 32B-Preview