Model Comparison

Compare two models side by side to make informed decisions based on pricing, specifications, and performance.

Tool Invocation

Google's mid-size multimodal model with 2M token context for text, image, audio, and video understanding.

Pricing

Input
$1.13/MTokens
Output
$4.50/MTokens
Input Audio
$1.13/MTokens

Specifications

Context
2,097,152
Maximum Output
8,192
Inputtext, image, audio, video
Outputtext

No model selected

AI Model Comparison - Compare GPT-4, Claude, Gemini & More