doubao-1.5-thinking-vision-pro-250428

Common Name: Doubao 1.5 Thinking Vision Pro (250428)

ByteDance
Released on May 23 12:00 AMTool Invocation

Premium multimodal model combining thinking capabilities with advanced vision understanding. Supports text, image, and video inputs with 64K context for sophisticated reasoning over visual content.

Specifications

Context64,000
Maximum Output16,384
Inputtext, image, video
Outputtext

Performance (7-day Average)

Uptime
TPS
RURT

Pricing

Input¥3.00/MTokens
Output¥9.00/MTokens

Usage Statistics

No usage data available for this model during the selected period
View your usage statistics for this model

Similar Models

¥2.00/¥8.00/M
ctx128Kmax16Kavailtps
InOutCap

DeepSeek's reasoning-focused model hosted on ByteDance infrastructure, optimized for complex problem-solving and logical reasoning tasks. Supports 128K context with strong analytical capabilities.

¥1.50/¥4.50/M
ctx64Kmax16Kavailtps
InOutCap

Lightweight vision-language model from the Doubao 1.5 series, balancing efficiency with multimodal understanding. Supports text and image inputs with 64K context for cost-effective visual tasks.

¥3.00/¥9.00/M
ctx64Kmax16Kavailtps
InOutCap

Advanced vision-language model with enhanced image understanding and analysis capabilities. Features 64K context window and excels at complex visual reasoning and multimodal tasks.

¥4.00/¥16.00/M
ctx64Kmax16Kavailtps
InOutCap

Professional thinking-enhanced model designed for complex reasoning and analytical tasks. Supports 64K context with text and image inputs, excelling at multi-step problem solving.

Documentation

No documentation available
This model (doubao-1.5-thinking-vision-pro-250428) uses a dedicated API. Please refer to the official documentation for usage examples.