gemini-2.5-flash

Common Name: Gemini 2.5 Flash

Google
-10%On SaleReleased on Jun 18 12:00 AMKnowledge Cutoff Apr 1 12:00 AMTool Invocation

Google's most efficient workhorse model designed for speed and low-cost. Improved across key benchmarks for reasoning, multimodality, code and long context while being 20-30% more efficient.

Specifications

Context1,048,576
Maximum Output65,536
Inputtext, image, audio, video
Outputtext

Performance (7-day Average)

Uptime
TPS
RURT

Pricing

Input$0.27/MTokens
Input Audio$0.90/MTokens
Output$2.25/MTokens

Usage Statistics

No usage data available for this model during the selected period
View your usage statistics for this model

Similar Models

$0.45/$2.70/M
ctx1.0Mmax64Kavailtps
InOutCap

Preview of Google's next-generation Gemini 3 Flash model, optimized for speed with frontier intelligence combined with superior search and grounding capabilities.

$0.09/$0.36/M
ctx1.0Mmax66Kavailtps
InOutCap

A lightweight version of Gemini 2.5 Flash optimized for speed and cost efficiency with 1M token context support.

$1.13/$9.00/M
ctx1.0Mmax66Kavailtps
InOutCap

Google's most intelligent AI model with adaptive thinking capabilities. Among the world's best models for coding and tasks requiring advanced reasoning.

$1.80/$10.80/M
ctx1.0Mmax64Kavailtps

Preview of Google's next-generation Gemini 3 Pro model with enhanced reasoning and multimodal capabilities.

Documentation

No documentation available
This model (gemini-2.5-flash) uses a dedicated API. Please refer to the official documentation for usage examples.