gemini-1.5-flash-8b

Common Name: Gemini 1.5 Flash 8B

Google
-10%On SaleReleased on Aug 27, 2024 12:00 AMKnowledge Cutoff Apr 1, 2024 12:00 AMTool Invocation

Google's smallest Gemini model optimized for speed and cost efficiency with multimodal support.

Specifications

Context1,048,576
Maximum Output8,192
Inputtext, image, audio, video
Outputtext

Performance (7-day Average)

Uptime
TPS
RURT

Pricing

Standard
Input$0.03/MTokens
Output$0.14/MTokens
Input Audio$0.03/MTokens
128K Tier
Input$0.07/MTokens
Output$0.27/MTokens
Input Audio$0.07/MTokens

Usage Statistics

No usage data available for this model during the selected period
View your usage statistics for this model

Similar Models

$0.07/$0.27/M
ctx1.0Mmax8Kavailtps
InOutCap

A lightweight and fast version of Gemini 2.0 Flash optimized for cost-effective multimodal tasks with lower latency.

$0.07/$0.27/M
ctx1.0Mmax8Kavailtps
InOutCap

Google's most cost-efficient multimodal model with 1M token context, designed for high-volume applications requiring speed and affordability.

$0.07/$0.27/M
ctx1.0Mmax8Kavailtps
InOutCap

Google's fast, cost-efficient multimodal model with 1M token context for high-volume tasks.

$0.07/$0.27/M
ctx1.0Mmax8Kavailtps
InOutCap

Snapshot of Gemini 1.5 Flash with 1M token context for fast multimodal understanding.

Documentation

No documentation available
This model (gemini-1.5-flash-8b) uses a dedicated API. Please refer to the official documentation for usage examples.