BytedanceDouyinContent/SAILViT-Large-300M-448px Image Feature Extraction • 0.3B • Updated Jul 3 • 8 • 1
SAILViT: Towards Robust and Generalizable Visual Backbones for MLLMs via Gradual Feature Refinement Paper • 2507.01643 • Published Jul 2 • 1