3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding Paper • 2507.23478 • Published 5 days ago • 7 • 1
PresentAgent: Multimodal Agent for Presentation Video Generation Paper • 2507.04036 • Published about 1 month ago • 9
PresentAgent: Multimodal Agent for Presentation Video Generation Paper • 2507.04036 • Published about 1 month ago • 9 • 1
MediAug: Exploring Visual Augmentation in Medical Imaging Paper • 2504.18983 • Published Apr 26 • 7 • 1
DiffuMural: Restoring Dunhuang Murals with Multi-scale Diffusion Paper • 2504.09513 • Published Apr 13 • 1
DiffuMural: Restoring Dunhuang Murals with Multi-scale Diffusion Paper • 2504.09513 • Published Apr 13 • 1 • 2
GAMED-Snake: Gradient-aware Adaptive Momentum Evolution Deep Snake Model for Multi-organ Segmentation Paper • 2501.12844 • Published Jan 22
FDG-Diff: Frequency-Domain-Guided Diffusion Framework for Compressed Hazy Image Restoration Paper • 2501.12832 • Published Jan 22
MedConv: Convolutions Beat Transformers on Long-Tailed Bone Density Prediction Paper • 2502.00631 • Published Feb 2
PathoHR: Breast Cancer Survival Prediction on High-Resolution Pathological Images Paper • 2503.17970 • Published Mar 23 • 1
PathoHR: Breast Cancer Survival Prediction on High-Resolution Pathological Images Paper • 2503.17970 • Published Mar 23 • 1 • 2