Llama 3.1 Instruct 8B

Meta

Open

Context

Release Date: Jul 23, 2024
Knowledge Cutoff: Dec 01, 2023
Window: 128k

PricingPer 1M tokens

Input: $0.1
Output: $0.1
Blended 3:1: $0.1

Capabilities

Speed: 215 t/s
Input
Output
Reasoning tokens

Latency

TTFT: 0.29 ms
500 token response: 2.61 s

Benchmarks

Intelligence: ●○○○○
Math: ●○○○○
Coding: ●○○○○
MMLU Pro: 47.6%
GPQA: 25.9%
HLE: 5.1%
SciCode: 13.2%
AIME: 7.7%
MATH 500: 51.9%
LiveCodeBench: 11.6%
HumanEval: 66.5%

Llama 3.1 Instruct 8B