VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents Paper โข 2410.10594 โข Published Oct 14, 2024 โข 24
Transformer Explainer: Interactive Learning of Text-Generative Models Paper โข 2408.04619 โข Published Aug 8, 2024 โข 156
Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks Paper โข 2408.03615 โข Published Aug 7, 2024 โข 31
SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General Sound Paper โข 2405.00233 โข Published Apr 30, 2024 โข 16