A high-speed version of Doubao Seed 1.6 optimized for fast inference with multimodal support. Supports 256K context with excellent performance on text, image, and video understanding tasks.
ark-doubao-seed-1.6-vision-250815
Common Name: Doubao Seed 1.6 Vision (250815)
ByteDance's multimodal model with vision capabilities for text, image, and video understanding.
Specifications
Context128,000
Inputtext, image, video
Outputtext
Performance (7-day Average)
Uptime
TPS
RURT
Pricing
< 32K
Input¥0.80/MTokens
Output¥8.00/MTokens
32K-128K
Input¥1.20/MTokens
Output¥16.00/MTokens
128K-256K
Input¥2.40/MTokens
Output¥24.00/MTokens
256K+
Input¥2.40/MTokens
Output¥24.00/MTokens
Usage Statistics
No usage data available for this model during the selected period
View your usage statistics for this model
Similar Models
¥0.15/¥1.50/M
ctx256Kmax32Kavail—tps—
InOutCap
¥0.80/¥8.00/M
ctx256Kmax32Kavail—tps—
InOutCap
A reasoning-enhanced version of Doubao Seed 1.6 with extended thinking capabilities for complex problem-solving. Features 256K context window and advanced multimodal understanding.
¥0.15/¥1.50/M
ctx256Kmax16Kavail—tps—
InOutCap
An earlier fast-inference variant of Doubao Seed 1.6 with 256K context support. Optimized for speed while maintaining strong multimodal capabilities across text, image, and video inputs.
¥0.80/¥8.00/M
ctx256Kmax16Kavail—tps—
InOutCap
Previous version of the thinking-enabled Doubao Seed 1.6 model designed for enhanced reasoning and complex cognitive tasks. Supports 256K context with multimodal input processing.
Documentation
No documentation available
This model (ark-doubao-seed-1.6-vision-250815) uses a dedicated API. Please refer to the official documentation for usage examples.