Lens: Rethinking Training Efficiency for Foundational Text-to-Image Models Paper • 2605.21573 • Published 11 days ago • 104
TOBench: A Task-Oriented Omni-Modal Benchmark for Real-World Tool-Using Agents Paper • 2605.16909 • Published 15 days ago • 9
kairawal/Gemma-3-1B-IT-GA-SynthDolly-r16alpha128-E5-S73 Text Generation • 1.0B • Updated 8 days ago • 35 • 1
When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models Paper • 2604.08546 • Published Apr 9 • 115
Density-aware Soft Context Compression with Semi-Dynamic Compression Ratio Paper • 2603.25926 • Published Mar 26 • 8