Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers Paper • 2503.11579 • Published 16 days ago • 18
ABC: Achieving Better Control of Multimodal Embeddings using VLMs Paper • 2503.00329 • Published 29 days ago • 18
Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate Paper • 2501.17703 • Published Jan 29 • 57