Command Palette

Search for a command to run...

Llama 3 Instruct 8B

Meta

Open

Context

Release Date
Apr 18, 2024
Knowledge Cutoff
Mar 01, 2023
Window
8,192

PricingPer 1M tokens

Input
$0.060000000000000005
Output
$0.14
Blended 3:1
$0.085

Capabilities

Speed
103 t/s
Input
Output
Reasoning tokens

Latency

TTFT
0.34 ms
500 token response
5.20 s

Benchmarks

Intelligence
○○○○
Math
○○○○
Coding
○○○○
MMLU Pro
40.5%
GPQA
29.6%
HLE
5.1%
SciCode
11.9%
AIME
0.0%
MATH 500
49.9%
LiveCodeBench
9.6%
HumanEval
70.5%

Llama 3 Instruct 8B