gpt-realtime-2025-08-28

Common Name: GPT Realtime

OpenAI
Released on Oct 7 12:00 AM

This is our first general-availability realtime model, capable of responding to audio and text inputs in realtime over WebRTC, WebSocket, or SIP connections.

Specifications

Context32,000
Maximum Output4,096
Inputtext, audio, image
Outputtext, audio

Performance (7-day Average)

Uptime
TPS
RURT

Pricing

Input$4.40/MTokens
Cached Input$0.55/MTokens
Output$17.60/MTokens
Input Audio$35.20/MTokens
cached input audio$0.55/MTokens
Output Audio$70.40/MTokens
Input Image$5.50/MTokens

Usage Statistics

No usage data available for this model during the selected period
View your usage statistics for this model

Similar Models

$3.30/$4.40/M
ctx16Kmax4Kavailtps
InOutCap

GPT-3.5 Turbo variant with extended 16K token context window for longer conversations and documents.

$4.40/$17.60/M
ctx32Kmax4Kavailtps
InOut

This is our first general-availability realtime model, capable of responding to audio and text inputs in realtime over WebRTC, WebSocket, or SIP connections.

$2.20/$2.20/M
ctx16Kmax4Kavailtps
InOut

Base model for fine-tuning and legacy applications, replacing the original davinci base model.

$2.20/$8.80/M
ctx1.0Mmax33Kavailtps
InOutCap

GPT-4.1 is an enhanced version of GPT-4 with improved instruction following and multimodal capabilities for text and image understanding.

Documentation

No documentation available
This model (gpt-realtime-2025-08-28) uses a dedicated API. Please refer to the official documentation for usage examples.