Latest Qwen3 thinking model, competitive against the best close source models in Jul 2025.
fireworks/models/gpt-oss-120b
Common Name: OpenAI gpt-oss-120b
Welcome to the gpt-oss series, OpenAI's open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. gpt-oss-120b is used for production, general purpose, high reasoning use-cases that fits into a single H100 GPU.
Specifications
Performance (7-day Average)
Pricing
Usage Statistics
Similar Models
Updated FP8 version of Qwen3-235B-A22B non-thinking mode, with better tool use, coding, instruction following, logical reasoning and text comprehension capabilities
Latest Qwen3 state of the art model, 235B with 22B active parameter model
The Meta Llama 3.1 collection of multilingual large language models (LLMs) is a collection of pretrained and instruction tuned generative models in 8B, 70B and 405B sizes. The Llama 3.1 instruction tuned text only models (8B, 70B, 405B) are optimized for multilingual dialogue use cases and outperform many of the available open source and closed chat models on common industry benchmarks.