Quick facts

Model ID llama-3.3-70b

Source Venice AI

Context Window 128000

Pricing $0.70 input / $2.80 output per 1M tokens

Capabilities tool calling, open weights

Model overview

Llama 3.3 70B is an AI model from Venice AI with 128000 token context window and text input support.

Published pricing is $0.70 input and $2.80 output per 1M tokens.

Model ID llama-3.3-70b

Provider Venice AI

Family llama

Status -

Knowledge Cutoff -

Release Date 2025-04-06

Input Modalities text

Output Modalities text

Context Window 128000

Input Limit -

Output Limit 4096

Tool Calling Yes

Reasoning No

Structured Output -

Temperature Control -

Open Weights Yes

Input Cost / 1M tokens $0.70

Output Cost / 1M tokens $2.80

Reasoning Cost / 1M tokens -

Cache Read Cost / 1M tokens -

Cache Write Cost / 1M tokens -