Model Comparison

Compare two models side by side to make informed decisions based on pricing, specifications, and performance.

This is our first general-availability realtime model, capable of responding to audio and text inputs in realtime over WebRTC, WebSocket, or SIP connections.

Pricing

Input
$4.40/MTokens
Cached Input
$0.55/MTokens
Output
$17.60/MTokens
Input Audio
$35.20/MTokens
cached input audio
$0.55/MTokens
Output Audio
$70.40/MTokens
Input Image
$5.50/MTokens

Specifications

Context
32,000
Maximum Output
4,096
Inputtext, audio, image
Outputtext, audio

No model selected

AI Model Comparison - Compare GPT-4, Claude, Gemini & More