fireworks/models/llama4-scout-instruct-basic

Common Name: Llama 4 Scout Instruct (Basic)

Released on Oct 16, 2025 12:00 AMSupportedTool Invocation

The Llama 4 collection of models are natively multimodal AI models that enable text and multimodal experiences. These models leverage a mixture-of-experts architecture to offer industry-leading performance in text and image understanding.

Specifications

Context

1000K

Inputtext, image

Outputtext

Performance (7-day Average)

Collecting…

Pricing

Input$0.17/MTokens

Output$0.66/MTokens

Availability Trend (24h)

Performance Metrics (24h)

Similar Models

Llama 4 Maverick Instruct (Basic)

$0.24/$0.97/M

ctx1.0Mmax—avail—tps—

InOutCap

OpenAI gpt-oss-120b

$0.17/$0.66/M

ctx128Kmax—avail—tps—

InOutCap

Welcome to the gpt-oss series, OpenAI's open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. gpt-oss-120b is used for production, general purpose, high reasoning use-cases that fits into a single H100 GPU.

Qwen3 235B A22B Thinking 2507

$0.24/$0.97/M

ctx256Kmax—avail—tps—

InOutCap

Latest Qwen3 thinking model, competitive against the best close source models in Jul 2025.

Qwen3 235B A22B Instruct 2507

$0.24/$0.97/M

ctx256Kmax—avail—tps—

InOutCap

Updated FP8 version of Qwen3-235B-A22B non-thinking mode, with better tool use, coding, instruction following, logical reasoning and text comprehension capabilities