whichllm — Browse and compare AI model specs and pricing

Azure Cognitive Services

Meta-Llama-3.1-8B-Instruct model ID, context window & pricing

llama

Quick facts

Model ID meta-llama-3.1-8b-instruct
Source Azure Cognitive Services
Context Window 128000
Pricing $0.30 input / $0.61 output per 1M tokens
Capabilities tool calling, temperature control, open weights

Model overview

Meta-Llama-3.1-8B-Instruct is an AI model from Azure Cognitive Services with 128000 token context window and text input support.

Published pricing is $0.30 input and $0.61 output per 1M tokens.

  • Workloads that use text inputs with text outputs.
  • Agent and tool workflows that need function calling.
Model ID meta-llama-3.1-8b-instruct
Provider Azure Cognitive Services
Family llama
Status -
Knowledge Cutoff 2023-12
Release Date 2024-07-23
Input Modalities text
Output Modalities text
Context Window 128000
Input Limit -
Output Limit 32768
Tool Calling Yes
Reasoning No
Structured Output -
Temperature Control Yes
Open Weights Yes
Input Cost / 1M tokens $0.30
Output Cost / 1M tokens $0.61
Reasoning Cost / 1M tokens -
Cache Read Cost / 1M tokens -
Cache Write Cost / 1M tokens -