GPTailor: Large Language Model Pruning Through Layer Cutting and Stitching Paper • 2506.20480 • Published 1 day ago • 3
MORSE-500: A Programmatically Controllable Video Benchmark to Stress-Test Multimodal Reasoning Paper • 2506.05523 • Published 21 days ago • 33
view article Article Highlights from the First ICLR 2025 Watermarking Workshop By hadyelsahar and 4 others • May 14 • 11
Recurrent Models Collection These are checkpoints for recurrent LLMs developed to scale test-time compute by recurring in latent space. • 15 items • Updated May 21 • 8