Perspective-Aware Reasoning in Vision-Language Models via Mental Imagery Simulation Paper โข 2504.17207 โข Published Apr 24 โข 29
Step1X-Edit: A Practical Framework for General Image Editing Paper โข 2504.17761 โข Published Apr 24 โข 89
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 โข 11 items โข Updated Apr 28 โข 500
Llama 3.2 Collection This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 โข 15 items โข Updated Dec 6, 2024 โข 622