3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding Paper • 2507.23478 • Published 5 days ago • 7 • 1
PresentAgent: Multimodal Agent for Presentation Video Generation Paper • 2507.04036 • Published about 1 month ago • 9 • 1
MediAug: Exploring Visual Augmentation in Medical Imaging Paper • 2504.18983 • Published Apr 26 • 7 • 1
DiffuMural: Restoring Dunhuang Murals with Multi-scale Diffusion Paper • 2504.09513 • Published Apr 13 • 1 • 2
PathoHR: Breast Cancer Survival Prediction on High-Resolution Pathological Images Paper • 2503.17970 • Published Mar 23 • 1 • 2
DOEI: Dual Optimization of Embedding Information for Attention-Enhanced Class Activation Maps Paper • 2502.15885 • Published Feb 21 • 3 • 2
KMM: Key Frame Mask Mamba for Extended Motion Generation Paper • 2411.06481 • Published Nov 10, 2024 • 5 • 2
Motion Mamba: Efficient and Long Sequence Motion Generation with Hierarchical and Bidirectional Selective SSM Paper • 2403.07487 • Published Mar 12, 2024 • 17 • 4