Llama 2 Chat 7B

Meta

Open

Context

Release Date: Jul 18, 2023
Knowledge Cutoff: Jul 01, 2023
Window: 4,096

PricingPer 1M tokens

Input: $0.05
Output: $0.25
Blended 3:1: $0.1

Capabilities

Speed: 131 t/s
Input
Output
Reasoning tokens

Latency

TTFT: 0.42 ms
500 token response: 4.24 s

Benchmarks

Intelligence: ○○○○○
Math: ○○○○○
Coding: ○○○○○
MMLU Pro: 16.4%
GPQA: 22.7%
HLE: 5.8%
SciCode: 0.0%
AIME: 0.0%
MATH 500: 5.9%
LiveCodeBench: 0.2%

Llama 2 Chat 7B