Benched.ai
View
About
Command Palette
Search for a command to run...
Models
Llama 3.1 Instruct 70B
Meta
Open
Context
Release Date
Jul 23, 2024
Knowledge Cutoff
Dec 01, 2023
Window
128k
Pricing
Per 1M tokens
Input
$0.72
Output
$0.72
Blended 3:1
$0.72
Capabilities
Speed
60 t/s
Input
Output
Reasoning tokens
Latency
TTFT
0.43 ms
500 token response
8.76 s
Benchmarks
Intelligence
●●○○○
Math
●●○○○
Coding
●○○○○
MMLU Pro
67.6%
GPQA
40.9%
HLE
4.6%
SciCode
26.7%
AIME
17.3%
MATH 500
64.9%
LiveCodeBench
23.2%
HumanEval
81.2%
Llama 3.1 Instruct 70B