ComfyUI_Qwen2-VL-Instruct

This is an implementation of [a/Qwen2-VL-Instruct](https://github.com/QwenLM/Qwen2-VL) by [a/ComfyUI](https://github.com/comfyanonymous/ComfyUI), which includes, but is not limited to, support for text-based queries, video queries, single-image queries, and multi-image queries to generate captions or responses.

97
Stars
IuvenisSapiens
Author
4/2/2025
Last Update
760
Days

Category

image processing

Description

This is an implementation of [a/Qwen2-VL-Instruct](https://github.com/QwenLM/Qwen2-VL) by [a/ComfyUI](https://github.com/comfyanonymous/ComfyUI), which includes, but is not limited to, support for text-based queries, video queries, single-image queries, and multi-image queries to generate captions or responses.

Technical Information

Install Type:git-clone
Node ID:24490

Related Nodes

Discover more nodes in the same category or by the same author.

View all image processing nodes →