Llama 3.1 Instruct 70B

Meta

Open

Context

Release Date: Jul 23, 2024
Knowledge Cutoff: Dec 01, 2023
Window: 128k

PricingPer 1M tokens

Input: $0.72
Output: $0.72
Blended 3:1: $0.72

Capabilities

Speed: 60 t/s
Input
Output
Reasoning tokens

Latency

TTFT: 0.43 ms
500 token response: 8.76 s

Benchmarks

Intelligence: ●●○○○
Math: ●●○○○
Coding: ●○○○○
MMLU Pro: 67.6%
GPQA: 40.9%
HLE: 4.6%
SciCode: 26.7%
AIME: 17.3%
MATH 500: 64.9%
LiveCodeBench: 23.2%
HumanEval: 81.2%

Llama 3.1 Instruct 70B