Qwen2.5-VL-7B-Instruct
Qwen/Qwen2.5-VL-7B-Instruct
Qwen2.5-VL is a new member of the Qwen series, equipped with powerful visual comprehension capabilities. It can analyze text, charts, and layouts within images, understand long videos, and capture events. It is capable of reasoning, manipulating tools, supporting multi-format object localization, and generating structured outputs. The model has been optimized for dynamic resolution and frame rate training in video understanding, and has improved the efficiency of the visual encoder.

Details
Model Provider
Qwen
Type
text
Sub Type
chat
Size
0
Publish Time
Jan 28, 2025
Input Price
$
0.05
/ M Tokens
Output Price
$
0.05
/ M Tokens
Context length
32768
Tags
7B,32K