Command Palette

Search for a command to run...

Llama 2 Chat 7B

Meta

Open

Context

Release Date
Jul 18, 2023
Knowledge Cutoff
Jul 01, 2023
Window
4,096

PricingPer 1M tokens

Input
$0.05
Output
$0.25
Blended 3:1
$0.1

Capabilities

Speed
131 t/s
Input
Output
Reasoning tokens

Latency

TTFT
0.42 ms
500 token response
4.24 s

Benchmarks

Intelligence
○○○○○
Math
○○○○○
Coding
○○○○○
MMLU Pro
16.4%
GPQA
22.7%
HLE
5.8%
SciCode
0.0%
AIME
0.0%
MATH 500
5.9%
LiveCodeBench
0.2%

Llama 2 Chat 7B