Inability to apply qwen2-vl or llama-3.2-vision #1100

Yoo-Youngjae · 2025-01-24T08:33:29Z

Is your feature request related to a problem? Please describe.

I would like to leverage guidance on vlm models such as qwen2-vl or llama3.2-vision.

However, these are difficult to use because they use the latest transformer's MllamaForConditionalGeneration and Qwen2VLForConditionalGeneration, which causes an error in the current guidance.

Is it possible to update these to support them more widely?

Additional context

Or is it because I am not good at using it?

If anyone has solved it, please let me know the solution.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inability to apply qwen2-vl or llama-3.2-vision #1100

Inability to apply qwen2-vl or llama-3.2-vision #1100

Yoo-Youngjae commented Jan 24, 2025 •

edited

Loading

Inability to apply qwen2-vl or llama-3.2-vision #1100

Inability to apply qwen2-vl or llama-3.2-vision #1100

Comments

Yoo-Youngjae commented Jan 24, 2025 • edited Loading

Yoo-Youngjae commented Jan 24, 2025 •

edited

Loading