whichllm — Browse and compare AI model specs and pricing

NanoGPT

Nvidia Nemotron 70b model ID, context window & pricing

nemotron

Quick facts

Model ID nvidia/Llama-3.1-Nemotron-70B-Instruct-HF
Source NanoGPT
Context Window 16384
Pricing $0.36 input / $0.41 output per 1M tokens
Capabilities temperature control

Model overview

Nvidia Nemotron 70b is an AI model from NanoGPT with 16384 token context window and text input support.

Published pricing is $0.36 input and $0.41 output per 1M tokens.

  • Workloads that use text inputs with text outputs.
Model ID nvidia/Llama-3.1-Nemotron-70B-Instruct-HF
Provider NanoGPT
Family nemotron
Status -
Knowledge Cutoff -
Release Date 2025-04-15
Input Modalities text
Output Modalities text
Context Window 16384
Input Limit 16384
Output Limit 8192
Tool Calling No
Reasoning No
Structured Output No
Temperature Control Yes
Open Weights No
Input Cost / 1M tokens $0.36
Output Cost / 1M tokens $0.41
Reasoning Cost / 1M tokens -
Cache Read Cost / 1M tokens -
Cache Write Cost / 1M tokens -