Command Palette

Search for a command to run...

DeepSeek R1 Distill Llama 8B

DeepSeek

ReasoningOpen

Context

Release Date
Jan 20, 2025
Window
128k

PricingPer 1M tokens

Input
$0.04
Output
$0.04
Blended 3:1
$0.04

Capabilities

Speed
57 t/s
Input
Output
Reasoning tokens

Latency

TTFT
0.73 ms
500 token response
44.94 s

Benchmarks

Reasoning
●●○○○
Math
●●●○○
Coding
○○○○
MMLU Pro
54.3%
GPQA
30.2%
HLE
4.2%
SciCode
11.9%
AIME
33.3%
MATH 500
85.3%
LiveCodeBench
23.3%
HumanEval
83.5%

DeepSeek R1 Distill Llama 8B