TA/meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo

Common Name: Llama 3.1 405B Instruct Turbo

Released on Feb 17, 2025 12:00 AM

Meta's largest Llama 3.1 405B model optimized for fast inference on TogetherAI.

Context

128K

Inputtext

Outputtext

Collecting…

Input$3.85/MTokens

Output$3.85/MTokens

Availability Trend (24h)

$3.30/$7.70/M

ctx64Kmax8Kavail—tps—

InOut

DeepSeek's reasoning model trained via large-scale reinforcement learning, hosted on TogetherAI.

$2.20/$2.20/M

ctx128Kmax—avail—tps—

InOut

DeepSeek R1 reasoning model distilled to Llama 70B architecture, hosted on TogetherAI.

$0.17/$0.66/M

ctx128Kmax—avail—tps—

InOutCap

OpenAI's open-weight 120B model for production and high reasoning use cases, hosted on TogetherAI.

$0.06/$0.22/M

ctx128Kmax—avail—tps—

InOutCap

OpenAI's open-weight 20B model for lower latency and local use cases, hosted on TogetherAI.