novita/qwen-2-vl-72b-instruct

public

Published on 3/5/2025

qwen-2-vl-72b-instruct

Qwen-2-VL-72B-Instruct is a multimodal model handling images, videos, and text across languages. This 72B parameter model excels in visual analysis, 20+ minute video understanding, device control, and multilingual text recognition.

Models

qwen-2-vl-72b-instruct

anthropic

chat

edit

apply