Quick facts

Model ID nvidia/Llama-3.1-Nemotron-70B-Instruct-HF

Source NanoGPT

Context Window 16384

Pricing $0.36 input / $0.41 output per 1M tokens

Capabilities temperature control

Model overview

Nvidia Nemotron 70b is an AI model from NanoGPT with 16384 token context window and text input support.

Published pricing is $0.36 input and $0.41 output per 1M tokens.

Model ID nvidia/Llama-3.1-Nemotron-70B-Instruct-HF

Provider NanoGPT

Family nemotron

Status -

Knowledge Cutoff -

Release Date 2025-04-15

Input Modalities text

Output Modalities text

Context Window 16384

Input Limit 16384

Output Limit 8192

Tool Calling No

Reasoning No

Structured Output No

Temperature Control Yes

Open Weights No

Input Cost / 1M tokens $0.36

Output Cost / 1M tokens $0.41

Reasoning Cost / 1M tokens -

Cache Read Cost / 1M tokens -

Cache Write Cost / 1M tokens -