whichllm — Browse and compare AI model specs and pricing

Venice AI

Llama 3.2 3B model ID, context window & pricing

llama

Quick facts

Model ID llama-3.2-3b
Source Venice AI
Context Window 128000
Pricing $0.15 input / $0.60 output per 1M tokens
Capabilities tool calling, open weights

Model overview

Llama 3.2 3B is an AI model from Venice AI with 128000 token context window and text input support.

Published pricing is $0.15 input and $0.60 output per 1M tokens.

  • Workloads that use text inputs with text outputs.
  • Agent and tool workflows that need function calling.
Model ID llama-3.2-3b
Provider Venice AI
Family llama
Status -
Knowledge Cutoff -
Release Date 2024-10-03
Input Modalities text
Output Modalities text
Context Window 128000
Input Limit -
Output Limit 4096
Tool Calling Yes
Reasoning No
Structured Output -
Temperature Control -
Open Weights Yes
Input Cost / 1M tokens $0.15
Output Cost / 1M tokens $0.60
Reasoning Cost / 1M tokens -
Cache Read Cost / 1M tokens -
Cache Write Cost / 1M tokens -