whichllm — Browse and compare AI model specs and pricing

Nvidia

Llama 3.1 8B Instruct model ID, context window & pricing

llama

Quick facts

Model ID meta/llama-3.1-8b-instruct
Source Nvidia
Context Window 16000
Pricing -
Capabilities tool calling, temperature control, open weights

Model overview

Llama 3.1 8B Instruct is an AI model from Nvidia with 16000 token context window and text input support.

Public token pricing is not listed for this model in the current catalog source.

  • Workloads that use text inputs with text outputs.
  • Agent and tool workflows that need function calling.
Model ID meta/llama-3.1-8b-instruct
Provider Nvidia
Family llama
Status -
Knowledge Cutoff 2023-12
Release Date 2025-01-01
Input Modalities text
Output Modalities text
Context Window 16000
Input Limit -
Output Limit 4096
Tool Calling Yes
Reasoning No
Structured Output -
Temperature Control Yes
Open Weights Yes
Input Cost / 1M tokens -
Output Cost / 1M tokens -
Reasoning Cost / 1M tokens -
Cache Read Cost / 1M tokens -
Cache Write Cost / 1M tokens -