JavisDiT: Joint Audio-Video Diffusion Transformer with Hierarchical Spatio-Temporal Prior Synchronization Paper • 2503.23377 • Published 10 days ago • 49
BSharedRAG: Backbone Shared Retrieval-Augmented Generation for the E-commerce Domain Paper • 2409.20075 • Published Sep 30, 2024 • 1
ETVA: Evaluation of Text-to-Video Alignment via Fine-grained Question Generation and Answering Paper • 2503.16867 • Published 19 days ago • 11
ETVA: Evaluation of Text-to-Video Alignment via Fine-grained Question Generation and Answering Paper • 2503.16867 • Published 19 days ago • 11
Video Understanding with Large Language Models: A Survey Paper • 2312.17432 • Published Dec 29, 2023 • 3
DNAGPT: A Generalized Pretrained Tool for Multiple DNA Sequence Analysis Tasks Paper • 2307.05628 • Published Jul 11, 2023 • 10
Cross Contrasting Feature Perturbation for Domain Generalization Paper • 2307.12502 • Published Jul 24, 2023
Emo-Avatar: Efficient Monocular Video Style Avatar through Texture Rendering Paper • 2402.00827 • Published Feb 1, 2024 • 2