Models

Products

Pricing

Docs

Blog

About

Contact

Back to Models

Qwen2.5-VL-32B-Instruct

Qwen/Qwen2.5-VL-32B-Instruct

Qwen2.5-VL-32B-Instruct is a multimodal large language model released by the Qwen team, part of the Qwen2.5-VL series. This model is not only proficient in recognizing common objects but is highly capable of analyzing texts, charts, icons, graphics, and layouts within images. It acts as a visual agent that can reason and dynamically direct tools, capable of computer and phone use. Additionally, the model can accurately localize objects in images, and generate structured outputs for data like invoices and tables. Compared to its predecessor Qwen2-VL, this version has enhanced mathematical and problem-solving abilities through reinforcement learning, with response styles adjusted to better align with human preferences

API Usage

cURL

Python

JavaScript

curl --request POST \
  --url https://api.ap.siliconflow.com/v1/chat/completions \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '{
  "model": "Qwen/Qwen2.5-VL-32B-Instruct",
  "stream": false,
  "max_tokens": 512,
  "temperature": 0.7,
  "top_p": 0.7,
  "top_k": 50,
  "frequency_penalty": 0.5,
  "n": 1,
  "stop": []
}'

Details

Model Provider

Qwen2.5

Type

text

Sub Type

chat

Size

32

Publish Time

Mar 24, 2025

Input Price

$

0.27

/ M Tokens

Output Price

$

0.27

/ M Tokens

Context length

131072

Tags

32B,128K

Open in Playround

API Reference

Ready to accelerate your AI development?

Ready to accelerate your AI development?

PAGES

MODELS

PRODUCTS

© 2025 SiliconFlow Technology PTE. LTD.

·

PAGES

MODELS

PRODUCTS

© 2025 SiliconFlow Technology PTE. LTD.

·

PAGES

MODELS

PRODUCTS

© 2025 SiliconFlow Technology PTE. LTD.

·