gemini-1.5-flash-8b

Common Name: Gemini 1.5 Flash 8B

Google

-10%On SaleReleased on Aug 27, 2024 12:00 AMKnowledge Cutoff Apr 1, 2024 12:00 AMTool Invocation

Compare Try in Chat

Google's smallest Gemini model optimized for speed and cost efficiency with multimodal support.

Specifications

Context1,048,576

Maximum Output8,192

Inputtext, image, audio, video

Outputtext

Performance (7-day Average)

Uptime

TPS

RURT

Pricing

Standard

Input$0.03/MTokens

Output$0.14/MTokens

Input Audio$0.03/MTokens

128K Tier

Input$0.07/MTokens

Output$0.27/MTokens

Input Audio$0.07/MTokens

Availability Trend (24h)

Performance Metrics (24h)

Usage Statistics

No usage data available for this model during the selected period

View your usage statistics for this model

View all usage statistics

Similar Models

Gemini 2.0 Flash Lite (001)

$0.07/$0.27/M

ctx1.0Mmax8Kavail—tps—

InOutCap

A lightweight and fast version of Gemini 2.0 Flash optimized for cost-effective multimodal tasks with lower latency.

Gemini 2.0 Flash Lite

$0.07/$0.27/M

ctx1.0Mmax8Kavail—tps—

InOutCap

Google's most cost-efficient multimodal model with 1M token context, designed for high-volume applications requiring speed and affordability.

Gemini 1.5 Flash (latest)

$0.07/$0.27/M

ctx1.0Mmax8Kavail—tps—

InOutCap

Google's fast, cost-efficient multimodal model with 1M token context for high-volume tasks.

Gemini 1.5 Flash (001)

$0.07/$0.27/M

ctx1.0Mmax8Kavail—tps—

InOutCap

Snapshot of Gemini 1.5 Flash with 1M token context for fast multimodal understanding.

Documentation

No documentation available

This model (gemini-1.5-flash-8b) uses a dedicated API. Please refer to the official documentation for usage examples.