Modifying Large Language Model Post-Training for Diverse Creative Writing Paper • 2503.17126 • Published 3 days ago • 18
TULIP: Towards Unified Language-Image Pretraining Paper • 2503.15485 • Published 5 days ago • 42
DAPO: An Open-Source LLM Reinforcement Learning System at Scale Paper • 2503.14476 • Published 6 days ago • 100
RWKV-7 "Goose" with Expressive Dynamic State Evolution Paper • 2503.14456 • Published 6 days ago • 127
Cockatiel: Ensembling Synthetic and Human Preferenced Training for Detailed Video Caption Paper • 2503.09279 • Published 13 days ago • 5
Autoregressive Image Generation with Randomized Parallel Decoding Paper • 2503.10568 • Published 11 days ago • 7
MagicInfinite: Generating Infinite Talking Videos with Your Words and Voice Paper • 2503.05978 • Published 17 days ago • 33
GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control Paper • 2503.03751 • Published 19 days ago • 20
C4AI Aya Vision Collection Aya Vision is a state-of-the-art family of vision models that brings multimodal capabilities to 23 languages. • 5 items • Updated 20 days ago • 68
Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think Paper • 2502.20172 • Published 25 days ago • 28
Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think Paper • 2502.20172 • Published 25 days ago • 28
MedVLM-R1: Incentivizing Medical Reasoning Capability of Vision-Language Models (VLMs) via Reinforcement Learning Paper • 2502.19634 • Published 26 days ago • 61