GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning Paper • 2507.01006 • Published 21 days ago • 191
Alchemist Collection 📊 Dataset and 🏆 checkpoints for paper 📝 "Alchemist: Turning Public Text-to-Image Data into Generative Gold" • 7 items • Updated May 27 • 16
wan2.1 controlnets Collection See code on github: https://github.com/TheDenk/wan2.1-dilated-controlnet • 6 items • Updated Jun 3 • 5
Self Forcing: Bridging the Train-Test Gap in Autoregressive Video Diffusion Paper • 2506.08009 • Published Jun 9 • 25
DCM: Dual-Expert Consistency Model for Efficient and High-Quality Video Generation Paper • 2506.03123 • Published Jun 3 • 14
T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT Paper • 2505.00703 • Published May 1 • 44
view article Article DiffRhythm: Revolutionizing Open Source AI Music Generator By Dzkaka • Mar 5 • 11