Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model Paper • 2408.11039 • Published Aug 20, 2024 • 63
Characterizing and Efficiently Accelerating Multimodal Generation Model Inference Paper • 2410.00215 • Published Sep 30, 2024
Altogether: Image Captioning via Re-aligning Alt-text Paper • 2410.17251 • Published Oct 22, 2024
CWM: An Open-Weights LLM for Research on Code Generation with World Models Paper • 2510.02387 • Published Sep 30 • 7
CWM: An Open-Weights LLM for Research on Code Generation with World Models Paper • 2510.02387 • Published Sep 30 • 7 • 2