Masked Diffusion Transformer is a Strong Image Synthesizer Paper • 2303.14389 • Published Mar 25, 2023 • 1
Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative Humor Generation Paper • 2312.02439 • Published Dec 5, 2023
Knowledge Graph Based Agent for Complex, Knowledge-Intensive QA in Medicine Paper • 2410.04660 • Published Oct 7, 2024
MoExtend: Tuning New Experts for Modality and Task Extension Paper • 2408.03511 • Published Aug 7, 2024
TxAgent: An AI Agent for Therapeutic Reasoning Across a Universe of Tools Paper • 2503.10970 • Published 15 days ago • 16
MagicInfinite: Generating Infinite Talking Videos with Your Words and Voice Paper • 2503.05978 • Published 21 days ago • 34
StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation Paper • 2405.01434 • Published May 2, 2024 • 56
PLLaVA : Parameter-free LLaVA Extension from Images to Videos for Video Dense Captioning Paper • 2404.16994 • Published Apr 25, 2024 • 36
PLLaVA : Parameter-free LLaVA Extension from Images to Videos for Video Dense Captioning Paper • 2404.16994 • Published Apr 25, 2024 • 36
Sora Generates Videos with Stunning Geometrical Consistency Paper • 2402.17403 • Published Feb 27, 2024 • 18
Magic-Me: Identity-Specific Video Customized Diffusion Paper • 2402.09368 • Published Feb 14, 2024 • 29
MagicVideo-V2: Multi-Stage High-Aesthetic Video Generation Paper • 2401.04468 • Published Jan 9, 2024 • 49
ChatAnything: Facetime Chat with LLM-Enhanced Personas Paper • 2311.06772 • Published Nov 12, 2023 • 35
ChatAnything: Facetime Chat with LLM-Enhanced Personas Paper • 2311.06772 • Published Nov 12, 2023 • 35
BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs Paper • 2307.08581 • Published Jul 17, 2023 • 28