Benched.ai
View
About
Command Palette
Search for a command to run...
Models
Llama 3.2 Instruct 1B
Meta
Open
Context
Release Date
Sep 25, 2024
Knowledge Cutoff
Dec 01, 2023
Window
128k
Pricing
Per 1M tokens
Input
$0.04
Output
$0.08
Blended 3:1
$0.05
Capabilities
Speed
177 t/s
Input
Output
Reasoning tokens
Latency
TTFT
0.29 ms
500 token response
3.12 s
Benchmarks
Intelligence
○○○○○
Math
○○○○○
Coding
○○○○○
MMLU Pro
20.0%
GPQA
19.6%
HLE
5.3%
SciCode
1.7%
AIME
0.0%
MATH 500
14.0%
LiveCodeBench
1.9%
HumanEval
40.2%
Llama 3.2 Instruct 1B