Llama 3.2 Instruct 1B

Meta

Open

Context

Release Date: Sep 25, 2024
Knowledge Cutoff: Dec 01, 2023
Window: 128k

PricingPer 1M tokens

Input: $0.04
Output: $0.08
Blended 3:1: $0.05

Capabilities

Speed: 177 t/s
Input
Output
Reasoning tokens

Latency

TTFT: 0.29 ms
500 token response: 3.12 s

Benchmarks

Intelligence: ○○○○○
Math: ○○○○○
Coding: ○○○○○
MMLU Pro: 20.0%
GPQA: 19.6%
HLE: 5.3%
SciCode: 1.7%
AIME: 0.0%
MATH 500: 14.0%
LiveCodeBench: 1.9%
HumanEval: 40.2%

Llama 3.2 Instruct 1B