yi-vision-v2

Common Name: YI Vision V2

01AI
Released on Feb 17 12:00 AMTool Invocation

01.AI's multimodal vision-language model for image understanding and analysis.

Specifications

Context16,000
Maximum Output16,000
Inputtext, image
Outputtext

Performance (7-day Average)

Uptime
TPS
RURT

Pricing

Input¥6.00/MTokens
Output¥6.00/MTokens

Usage Statistics

No usage data available for this model during the selected period
View your usage statistics for this model

Similar Models

¥0.99/¥0.99/M
ctx16Kmax16Kavailtps
InOutCap

01.AI's fast and efficient language model for general-purpose tasks.

$3.30/$4.40/M
ctx16Kmax4Kavailtps
InOutCap

GPT-3.5 Turbo variant with extended 16K token context window for longer conversations and documents.

$4.40/$17.60/M
ctx32Kmax4Kavailtps
InOut

This is our first general-availability realtime model, capable of responding to audio and text inputs in realtime over WebRTC, WebSocket, or SIP connections.

$4.40/$17.60/M
ctx32Kmax4Kavailtps
InOut

This is our first general-availability realtime model, capable of responding to audio and text inputs in realtime over WebRTC, WebSocket, or SIP connections.

Documentation

No documentation available
This model (yi-vision-v2) uses a dedicated API. Please refer to the official documentation for usage examples.