$0.22/$0.66/M
ctx128Kmax33Kavail—tps—
InOut
Qwen3 235B model with 22B active parameters optimized for throughput, hosted on TogetherAI.
OpenAI's open-weight 120B model for production and high reasoning use cases, hosted on TogetherAI.
Qwen3 235B model with 22B active parameters optimized for throughput, hosted on TogetherAI.
Meta's Llama 3.1 8B optimized for fast inference on TogetherAI.
Meta's Llama 2 7B chat model for conversational AI, hosted on TogetherAI.
Mistral AI's 7B instruction-tuned model v0.2 with improved performance, hosted on TogetherAI.