GenRecal: Generation after Recalibration from Large to Small Vision-Language Models Paper • 2506.15681 • Published 8 days ago • 36
Are Vision-Language Models Truly Understanding Multi-vision Sensor? Paper • 2412.20750 • Published Dec 30, 2024 • 20
VLsI: Verbalized Layers-to-Interactions from Large to Small Vision Language Models Paper • 2412.01822 • Published Dec 2, 2024 • 15
Phantom of Latent for Large Language and Vision Models Paper • 2409.14713 • Published Sep 23, 2024 • 30
SPARK: Multi-Vision Sensor Perception and Reasoning Benchmark for Large-scale Vision-Language Models Paper • 2408.12114 • Published Aug 22, 2024 • 14
TroL: Traversal of Layers for Large Language and Vision Models Paper • 2406.12246 • Published Jun 18, 2024 • 36
Meteor: Mamba-based Traversal of Rationale for Large Language and Vision Models Paper • 2405.15574 • Published May 24, 2024 • 56
MoAI: Mixture of All Intelligence for Large Language and Vision Models Paper • 2403.07508 • Published Mar 12, 2024 • 77
Mitigating Adversarial Vulnerability through Causal Parameter Estimation by Adversarial Double Machine Learning Paper • 2307.07250 • Published Jul 14, 2023 • 2
Demystifying Causal Features on Adversarial Examples and Causal Inoculation for Robust Network by Adversarial Instrumental Variable Regression Paper • 2303.01052 • Published Mar 2, 2023 • 3
Distilling Robust and Non-Robust Features in Adversarial Examples by Information Bottleneck Paper • 2204.02735 • Published Apr 6, 2022 • 4
Masking Adversarial Damage: Finding Adversarial Saliency for Robust and Sparse Network Paper • 2204.02738 • Published Apr 6, 2022 • 3