whichllm — Browse and compare AI model specs and pricing

evroc

Llama 3.3 70B

llama

Model ID nvidia/Llama-3.3-70B-Instruct-FP8
Provider evroc
Family llama
Status -
Knowledge Cutoff -
Release Date 2024-12-01
Input Modalities text
Output Modalities text
Context Window 131072
Input Limit -
Output Limit 32768
Tool Calling No
Reasoning No
Structured Output -
Temperature Control -
Open Weights Yes
Input Cost / 1M tokens $1.18
Output Cost / 1M tokens $1.18
Reasoning Cost / 1M tokens -
Cache Read Cost / 1M tokens -
Cache Write Cost / 1M tokens -