Benched.ai
View
About
Command Palette
Search for a command to run...
Models
Llama 2 Chat 7B
Meta
Open
Context
Release Date
Jul 18, 2023
Knowledge Cutoff
Jul 01, 2023
Window
4,096
Pricing
Per 1M tokens
Input
$0.05
Output
$0.25
Blended 3:1
$0.1
Capabilities
Speed
131 t/s
Input
Output
Reasoning tokens
Latency
TTFT
0.42 ms
500 token response
4.24 s
Benchmarks
Intelligence
○○○○○
Math
○○○○○
Coding
○○○○○
MMLU Pro
16.4%
GPQA
22.7%
HLE
5.8%
SciCode
0.0%
AIME
0.0%
MATH 500
5.9%
LiveCodeBench
0.2%
Llama 2 Chat 7B