view article Article SmolLM3: smol, multilingual, long-context reasoner By loubnabnl and 22 others • 9 days ago • 543
T-LoRA: Single Image Diffusion Model Customization Without Overfitting Paper • 2507.05964 • Published 9 days ago • 103
Overclocking LLM Reasoning: Monitoring and Controlling Thinking Path Lengths in LLMs Paper • 2506.07240 • Published Jun 8 • 6
DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge Paper • 2507.04447 • Published 11 days ago • 40
Do Vision-Language Models Have Internal World Models? Towards an Atomic Evaluation Paper • 2506.21876 • Published 20 days ago • 27
Thinking with Images for Multimodal Reasoning: Foundations, Methods, and Future Frontiers Paper • 2506.23918 • Published 17 days ago • 77
AsyncFlow: An Asynchronous Streaming RL Framework for Efficient LLM Post-Training Paper • 2507.01663 • Published 15 days ago • 5
LongAnimation: Long Animation Generation with Dynamic Global-Local Memory Paper • 2507.01945 • Published 15 days ago • 73
Radial Attention: O(nlog n) Sparse Attention with Energy Decay for Long Video Generation Paper • 2506.19852 • Published 23 days ago • 38
UrbanLLaVA: A Multi-modal Large Language Model for Urban Intelligence with Spatial Reasoning and Understanding Paper • 2506.23219 • Published 18 days ago • 7
Mem4Nav: Boosting Vision-and-Language Navigation in Urban Environments with a Hierarchical Spatial-Cognition Long-Short Memory System Paper • 2506.19433 • Published 23 days ago • 3
RLPR: Extrapolating RLVR to General Domains without Verifiers Paper • 2506.18254 • Published 24 days ago • 32
OmniGen2: Exploration to Advanced Multimodal Generation Paper • 2506.18871 • Published 24 days ago • 73