Quick facts

Model ID nvidia/Llama-3.3-70B-Instruct-FP8

Source evroc

Context Window 131072

Pricing $1.18 input / $1.18 output per 1M tokens

Capabilities open weights

Model overview

Llama 3.3 70B is an AI model from evroc with 131072 token context window and text input support.

Published pricing is $1.18 input and $1.18 output per 1M tokens.

Model ID nvidia/Llama-3.3-70B-Instruct-FP8

Provider evroc

Family llama

Status -

Knowledge Cutoff -

Release Date 2024-12-01

Input Modalities text

Output Modalities text

Context Window 131072

Input Limit -

Output Limit 32768

Tool Calling No

Reasoning No

Structured Output -

Temperature Control -

Open Weights Yes

Input Cost / 1M tokens $1.18

Output Cost / 1M tokens $1.18

Reasoning Cost / 1M tokens -

Cache Read Cost / 1M tokens -

Cache Write Cost / 1M tokens -