Llama 3.1 8B

llama-3.1-8b-instant
Context
131K
Max output
131K
Input price
$0.05
/1M tokens
Output price
$0.08
/1M tokens
Capabilities
vision
tool call
structured output
reasoning
json mode
streaming
fine tuning
batch
Details
Provider Groq
Familyllama-3.1
Statusactive
Input modalitiestext
Output modalitiestext
Knowledge cutoff
Release date
Deprecation date
Sourceofficial
Last updated2026-03-21
Max input
Pricing per 1M tokens
Input
$0.05
Output
$0.08
Cached
Batch in
Batch out
API
GET/v1/models/groq/llama-3.1-8b-instant

Llama 3.1 8B

Changes · 1 entries
llama-3.1-8b-instantcreate8b78603Mar 21, 2026, 05:16 AM