fireworks/models/llama4-scout-instruct-basic

Common Name: Llama 4 Scout Instruct (Basic)

Fireworks
Released on Oct 16 12:00 AMTool Invocation

The Llama 4 collection of models are natively multimodal AI models that enable text and multimodal experiences. These models leverage a mixture-of-experts architecture to offer industry-leading performance in text and image understanding.

Specifications

Context1,000,000
Inputtext, image
Outputtext

Performance (7-day Average)

Uptime
TPS
RURT

Pricing

Input$0.17/MTokens
Output$0.66/MTokens

Usage Statistics

No usage data available for this model during the selected period
View your usage statistics for this model

Similar Models

$0.24/$0.97/M
ctx1.0Mmaxavailtps
InOutCap

The Llama 4 collection of models are natively multimodal AI models that enable text and multimodal experiences. These models leverage a mixture-of-experts architecture to offer industry-leading performance in text and image understanding.

$0.17/$0.66/M
ctx128Kmaxavailtps
InOutCap

Welcome to the gpt-oss series, OpenAI's open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. gpt-oss-120b is used for production, general purpose, high reasoning use-cases that fits into a single H100 GPU.

$0.24/$0.97/M
ctx256Kmaxavailtps
InOutCap

Latest Qwen3 thinking model, competitive against the best close source models in Jul 2025.

$0.24/$0.97/M
ctx256Kmaxavailtps
InOutCap

Updated FP8 version of Qwen3-235B-A22B non-thinking mode, with better tool use, coding, instruction following, logical reasoning and text comprehension capabilities

Documentation

No documentation available
This model (fireworks/models/llama4-scout-instruct-basic) uses a dedicated API. Please refer to the official documentation for usage examples.