15 30 32

Kunchang Li

Andy1621

https://github.com/Andy1621

Andy1621

AI & ML interests

computer vision

Recent Activity

upvoted a paper about 1 month ago

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

upvoted a paper about 2 months ago

Mixture-of-Depths Attention

upvoted a paper 2 months ago

CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation

View all activity

Organizations

authored a paper 12 months ago

Emerging Properties in Unified Multimodal Pretraining

Paper • 2505.14683 • Published May 20, 2025 • 134

authored 16 papers about 1 year ago

UniFormer: Unified Transformer for Efficient Spatiotemporal Representation Learning

Paper • 2201.04676 • Published Jan 12, 2022

UniFormer: Unifying Convolution and Self-attention for Visual Recognition

Paper • 2201.09450 • Published Jan 24, 2022

You Only Need 90K Parameters to Adapt Light: A Light Weight Transformer for Image Enhancement and Exposure Correction

Paper • 2205.14871 • Published May 30, 2022

UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer

Paper • 2211.09552 • Published Nov 17, 2022

InternVideo: General Video Foundation Models via Generative and Discriminative Learning

Paper • 2212.03191 • Published Dec 6, 2022 • 1

MUSES: 3D-Controllable Image Generation via Multi-Modal Agent Collaboration

Paper • 2408.10605 • Published Aug 20, 2024 • 2

TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning

Paper • 2410.19702 • Published Oct 25, 2024 • 1

VideoChat-Flash: Hierarchical Compression for Long-Context Video Modeling

Paper • 2501.00574 • Published Dec 31, 2024 • 6

Make Your Training Flexible: Towards Deployment-Efficient Video Models

Paper • 2503.14237 • Published Mar 18, 2025 • 5

Seed1.5-VL Technical Report

Paper • 2505.07062 • Published May 11, 2025 • 157

authored 3 papers over 1 year ago

Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment

Paper • 2412.19326 • Published Dec 26, 2024 • 18

Causal Diffusion Transformers for Generative Modeling

Paper • 2412.12095 • Published Dec 16, 2024 • 23

Bootstrapping Language-Guided Navigation Learning with Self-Refining Data Flywheel

Paper • 2412.08467 • Published Dec 11, 2024 • 6

Kunchang Li

AI & ML interests

Recent Activity

Organizations

Andy1621's activity