whichllm — Browse and compare AI model specs and pricing

Alibaba (China)

DeepSeek R1 Distill Llama 8B

deepseek-thinking

Model ID deepseek-r1-distill-llama-8b
Provider Alibaba (China)
Family deepseek-thinking
Status -
Knowledge Cutoff -
Release Date 2025-01-01
Input Modalities text
Output Modalities text
Context Window 32768
Input Limit -
Output Limit 16384
Tool Calling Yes
Reasoning Yes
Structured Output -
Temperature Control Yes
Open Weights No
Input Cost / 1M tokens -
Output Cost / 1M tokens -
Reasoning Cost / 1M tokens -
Cache Read Cost / 1M tokens -
Cache Write Cost / 1M tokens -