IAA: Inner-Adaptor Architecture Empowers Frozen Large Language Model with Multimodal Capabilities Paper • 2408.12902 • Published Aug 23, 2024
Memory Attention Networks for Skeleton-based Action Recognition Paper • 1804.08254 • Published Apr 23, 2018
Deep Fisher Discriminant Learning for Mobile Hand Gesture Recognition Paper • 1707.03692 • Published Jul 12, 2017
Prompt as Knowledge Bank: Boost Vision-language model via Structural Representation for zero-shot medical detection Paper • 2502.16223 • Published Feb 22
What Makes Good Open-Vocabulary Detector: A Disassembling Perspective Paper • 2309.00227 • Published Sep 1, 2023
Disjoint Masking with Joint Distillation for Efficient Masked Image Modeling Paper • 2301.00230 • Published Dec 31, 2022