Running on CPU Upgrade 392 Omni Image Editor 🖼 392 Image edit, text to image, face swap, image upscale
DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models Paper • 2512.24165 • Published 4 days ago • 20
Figure It Out: Improving the Frontier of Reasoning with Active Visual Thinking Paper • 2512.24297 • Published 4 days ago • 5
Fantastic Reasoning Behaviors and Where to Find Them: Unsupervised Discovery of the Reasoning Process Paper • 2512.23988 • Published 5 days ago • 12
SpaceTimePilot: Generative Rendering of Dynamic Scenes Across Space and Time Paper • 2512.25075 • Published 3 days ago • 10
Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem Paper • 2512.24873 • Published 3 days ago • 39
Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models Paper • 2512.24618 • Published 4 days ago • 62
Running on Zero MCP Featured 148 Qwen-Image-Edit-2511-LoRAs-Fast 🎃 148 Demo of the Collection of Qwen Image Edit LoRAs
KernelEvolve: Scaling Agentic Kernel Coding for Heterogeneous AI Accelerators at Meta Paper • 2512.23236 • Published 6 days ago • 2
Bridging Your Imagination with Audio-Video Generation via a Unified Director Paper • 2512.23222 • Published 6 days ago • 4 • 3
Bridging Your Imagination with Audio-Video Generation via a Unified Director Paper • 2512.23222 • Published 6 days ago • 4
Yume-1.5: A Text-Controlled Interactive World Generation Model Paper • 2512.22096 • Published 8 days ago • 55
SurgWorld: Learning Surgical Robot Policies from Videos via World Modeling Paper • 2512.23162 • Published 6 days ago • 9