Models

Products

Pricing

Docs

Blog

About

Contact

Back to Models

Qwen2.5-VL-7B-Instruct

Qwen/Qwen2.5-VL-7B-Instruct

Qwen2.5-VL is a new member of the Qwen series, equipped with powerful visual comprehension capabilities. It can analyze text, charts, and layouts within images, understand long videos, and capture events. It is capable of reasoning, manipulating tools, supporting multi-format object localization, and generating structured outputs. The model has been optimized for dynamic resolution and frame rate training in video understanding, and has improved the efficiency of the visual encoder.

API Usage

cURL

Python

JavaScript

curl --request POST \
  --url https://api.ap.siliconflow.com/v1/chat/completions \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '{
  "model": "Qwen/Qwen2.5-VL-7B-Instruct",
  "stream": false,
  "max_tokens": 512,
  "temperature": 0.7,
  "top_p": 0.7,
  "top_k": 50,
  "frequency_penalty": 0.5,
  "n": 1,
  "stop": []
}'

Details

Model Provider

Qwen

Type

text

Sub Type

chat

Size

0

Publish Time

Jan 28, 2025

Input Price

$

0.05

/ M Tokens

Output Price

$

0.05

/ M Tokens

Context length

32768

Tags

7B,32K

Open in Playround

API Reference

Ready to accelerate your AI development?

Ready to accelerate your AI development?

PAGES

MODELS

PRODUCTS

© 2025 SiliconFlow Technology PTE. LTD.

·

PAGES

MODELS

PRODUCTS

© 2025 SiliconFlow Technology PTE. LTD.

·

PAGES

MODELS

PRODUCTS

© 2025 SiliconFlow Technology PTE. LTD.

·