Are LLMs Effective Backbones for Fine-tuning? An Experimental Investigation of Supervised LLMs on Chinese Short Text Matching Paper • 2403.19930 • Published Mar 29, 2024
Hunyuan-TurboS: Advancing Large Language Models through Mamba-Transformer Synergy and Adaptive Chain-of-Thought Paper • 2505.15431 • Published May 21 • 1
UloRL:An Ultra-Long Output Reinforcement Learning Approach for Advancing Large Language Models' Reasoning Abilities Paper • 2507.19766 • Published 6 days ago • 10
InstantCharacter: Personalize Any Characters with a Scalable Diffusion Transformer Framework Paper • 2504.12395 • Published Apr 16 • 17
MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization Paper • 2501.01108 • Published Jan 2 • 1
XMusic: Towards a Generalized and Controllable Symbolic Music Generation Framework Paper • 2501.08809 • Published Jan 15 • 10
LeVo: High-Quality Song Generation with Multi-Preference Alignment Paper • 2506.07520 • Published Jun 9 • 5
MME: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models Paper • 2306.13394 • Published Jun 23, 2023
NormalCrafter: Learning Temporally Consistent Normals from Video Diffusion Priors Paper • 2504.11427 • Published Apr 15 • 19
Audio-visual Controlled Video Diffusion with Masked Selective State Spaces Modeling for Natural Talking Head Generation Paper • 2504.02542 • Published Apr 3 • 49
MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models Paper • 2403.09471 • Published Mar 14, 2024
GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors Paper • 2504.01016 • Published Apr 1 • 29
TransMamba: Flexibly Switching between Transformer and Mamba Paper • 2503.24067 • Published Mar 31 • 21