Taming LLMs by Scaling Learning Rates with Gradient Grouping Paper • 2506.01049 • Published Jun 1 • 36
OpenSTL: A Comprehensive Benchmark of Spatio-Temporal Predictive Learning Paper • 2306.11249 • Published Jun 20, 2023 • 2
Taming LLMs by Scaling Learning Rates with Gradient Grouping Paper • 2506.01049 • Published Jun 1 • 36
Boosting Discriminative Visual Representation Learning with Scenario-Agnostic Mixup Paper • 2111.15454 • Published Nov 30, 2021
MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization Paper • 2504.00999 • Published Apr 1 • 93
MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization Paper • 2504.00999 • Published Apr 1 • 93
WideRange4D: Enabling High-Quality 4D Reconstruction with Wide-Range Movements and Scenes Paper • 2503.13435 • Published Mar 17 • 17
CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models Paper • 2406.06007 • Published Jun 10, 2024 • 2
Unveiling the Backbone-Optimizer Coupling Bias in Visual Representation Learning Paper • 2410.06373 • Published Oct 8, 2024 • 34
Trans4D: Realistic Geometry-Aware Transition for Compositional Text-to-4D Synthesis Paper • 2410.07155 • Published Oct 9, 2024 • 11
Switch EMA: A Free Lunch for Better Flatness and Sharpness Paper • 2402.09240 • Published Feb 14, 2024 • 3
Switch EMA: A Free Lunch for Better Flatness and Sharpness Paper • 2402.09240 • Published Feb 14, 2024 • 3
OpenMixup: Open Mixup Toolbox and Benchmark for Visual Representation Learning Paper • 2209.04851 • Published Sep 11, 2022 • 2
OpenMixup: Open Mixup Toolbox and Benchmark for Visual Representation Learning Paper • 2209.04851 • Published Sep 11, 2022 • 2
SemiReward: A General Reward Model for Semi-supervised Learning Paper • 2310.03013 • Published Oct 4, 2023 • 2
LongVQ: Long Sequence Modeling with Vector Quantization on Structured Memory Paper • 2404.11163 • Published Apr 17, 2024
Short-Long Convolutions Help Hardware-Efficient Linear Attention to Focus on Long Sequences Paper • 2406.08128 • Published Jun 12, 2024 • 1
Unveiling the Backbone-Optimizer Coupling Bias in Visual Representation Learning Paper • 2410.06373 • Published Oct 8, 2024 • 34