M-Longdoc: A Benchmark For Multimodal Super-Long Document Understanding And A Retrieval-Aware Tuning Framework Paper โข 2411.06176 โข Published Nov 9, 2024 โข 46
Reward Steering with Evolutionary Heuristics for Decoding-time Alignment Paper โข 2406.15193 โข Published Jun 21, 2024 โข 15
Tango 2: Aligning Diffusion-based Text-to-Audio Generations through Direct Preference Optimization Paper โข 2404.09956 โข Published Apr 15, 2024 โข 12