whichllm — Browse and compare AI model specs and pricing

Nebius Token Factory

Llama-3.3-70B-Instruct

models.dev synced record

Model ID meta-llama/Llama-3.3-70B-Instruct
Provider Nebius Token Factory
Family -
Status -
Knowledge Cutoff 2025-08
Release Date 2025-12-05
Input Modalities text
Output Modalities text
Context Window 128000
Input Limit 120000
Output Limit 8192
Tool Calling Yes
Reasoning No
Structured Output Yes
Temperature Control Yes
Open Weights Yes
Input Cost / 1M tokens $0.13
Output Cost / 1M tokens $0.40
Reasoning Cost / 1M tokens -
Cache Read Cost / 1M tokens $0.01
Cache Write Cost / 1M tokens $0.16