Benched.ai
View
About
Command Palette
Search for a command to run...
Models
Llama 3.1 Instruct 8B
Meta
Open
Context
Release Date
Jul 23, 2024
Knowledge Cutoff
Dec 01, 2023
Window
128k
Pricing
Per 1M tokens
Input
$0.1
Output
$0.1
Blended 3:1
$0.1
Capabilities
Speed
215 t/s
Input
Output
Reasoning tokens
Latency
TTFT
0.29 ms
500 token response
2.61 s
Benchmarks
Intelligence
●○○○○
Math
●○○○○
Coding
●○○○○
MMLU Pro
47.6%
GPQA
25.9%
HLE
5.1%
SciCode
13.2%
AIME
7.7%
MATH 500
51.9%
LiveCodeBench
11.6%
HumanEval
66.5%
Llama 3.1 Instruct 8B