fireworks/models/llama-v3p1-70b-instruct

Common Name: Llama 3.1 70B Instruct

Released on Oct 16, 2025 12:00 AMSupportedTool Invocation

The Meta Llama 3.1 collection of multilingual large language models (LLMs) is a collection of pretrained and instruction tuned generative models in 8B, 70B and 405B sizes. The Llama 3.1 instruction tuned text only models (8B, 70B, 405B) are optimized for multilingual dialogue use cases and outperform many of the available open source and closed chat models on common industry benchmarks.

Specifications

Context

128K

Inputtext

Outputtext

Performance (7-day Average)

Collecting…

Pricing

Input$0.99/MTokens

Output$0.99/MTokens

Availability Trend (24h)

Performance Metrics (24h)

Similar Models

DeepSeek V3.1 Terminus

$0.62/$1.85/M

ctx160Kmax—avail—tps—

InOutCap

DeepSeek-V3.1-Terminus is an updated version of DeepSeek-V3.1 with enhanced language consistency, reduced mixed Chinese-English text, and optimized Code Agent and Search Agent performance.

Kimi K2 Instruct 0905

$0.66/$2.75/M

ctx256Kmax—avail—tps—

InOutCap

Kimi K2 0905 is an updated version of Kimi K2, a state-of-the-art mixture-of-experts (MoE) language model with 32 billion activated parameters and 1 trillion total parameters. Kimi K2 0905 has improved coding abilities, a longer context window, and agentic tool use, and a longer (262K) context window.

DeepSeek V3.1

$0.62/$1.85/M

ctx160Kmax—avail—tps—

InOutCap

DeepSeek-V3.1 is post-trained on the top of DeepSeek-V3.1-Base, which is built upon the original V3 base checkpoint through a two-phase long context extension approach, following the methodology outlined in the original DeepSeek-V3 report. We have expanded our dataset by collecting additional long documents and substantially extending both training phases. The 32K extension phase has been increased 10-fold to 630B tokens, while the 128K extension phase has been extended by 3.3x to 209B tokens. Additionally, DeepSeek-V3.1 is trained using the UE8M0 FP8 scale data format to ensure compatibility with microscaling data formats.

Qwen3 Coder 480B A35B Instruct

$0.50/$1.98/M

ctx256Kmax—avail—tps—

InOutCap

Qwen3's most agentic code model to date