EnxinSong's picture

EnxinSong

Enxin

·

https://enxinsong.com/

AI & ML interests

None yet

Recent Activity

upvoted a paper 20 days ago

Masking Stale Observations Helps Search Agents -- Until It Doesn't: A Regime Map and Its Mechanism

upvoted a paper 4 months ago

Beyond Language Modeling: An Exploration of Multimodal Pretraining

upvoted a paper 5 months ago

Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders

View all activity

Organizations

authored 3 papers 12 months ago

Devil in the Number: Towards Robust Multi-modality Data Filter

Paper • 2309.13770 • Published Sep 24, 2023

AuroraCap: Efficient, Performant Video Detailed Captioning and a New Benchmark

Paper • 2410.03051 • Published Oct 4, 2024 • 6

Video-MMLU: A Massive Multi-Discipline Lecture Understanding Benchmark

Paper • 2504.14693 • Published Apr 20, 2025

authored a paper over 1 year ago

Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis

Paper • 2410.08261 • Published Oct 10, 2024 • 52

authored a paper almost 3 years ago

MovieChat: From Dense Token to Sparse Memory for Long Video Understanding

Paper • 2307.16449 • Published Jul 31, 2023 • 17