Visual Representation Alignment for Multimodal Large Language Models Paper • 2509.07979 • Published Sep 9 • 83
Aligned Novel View Image and Geometry Synthesis via Cross-modal Attention Instillation Paper • 2506.11924 • Published Jun 13 • 34
Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills Paper • 2506.10387 • Published Jun 12 • 4
Geometry-Aware Score Distillation via 3D Consistent Noising and Gradient Consistency Modeling Paper • 2406.16695 • Published Jun 24, 2024 • 1
GeCoNeRF: Few-shot Neural Radiance Fields via Geometric Consistency Paper • 2301.10941 • Published Jan 26, 2023 • 1
Aligned Novel View Image and Geometry Synthesis via Cross-modal Attention Instillation Paper • 2506.11924 • Published Jun 13 • 34 • 2
Fine-Grained Perturbation Guidance via Attention Head Selection Paper • 2506.10978 • Published Jun 12 • 25
Geometry-Aware Score Distillation via 3D Consistent Noising and Gradient Consistency Modeling Paper • 2406.16695 • Published Jun 24, 2024 • 1
GenWarp: Single Image to Novel Views with Semantic-Preserving Generative Warping Paper • 2405.17251 • Published May 27, 2024 • 2
GeCoNeRF: Few-shot Neural Radiance Fields via Geometric Consistency Paper • 2301.10941 • Published Jan 26, 2023 • 1