FFN Fusion: Rethinking Sequential Computation in Large Language Models Paper • 2503.18908 • Published 4 days ago • 16
Modifying Large Language Model Post-Training for Diverse Creative Writing Paper • 2503.17126 • Published 8 days ago • 32
JARVIS-VLA-v1 Collection Vision-Language-Action Models in Minecraft. • 4 items • Updated 7 days ago • 9
Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't Paper • 2503.16219 • Published 8 days ago • 45
AudioX: Diffusion Transformer for Anything-to-Audio Generation Paper • 2503.10522 • Published 15 days ago • 21
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformers Paper • 2410.10629 • Published Oct 14, 2024 • 12
Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection Paper • 2503.12271 • Published 13 days ago • 9
VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning Paper • 2503.13444 • Published 11 days ago • 14
ReCamMaster: Camera-Controlled Generative Rendering from A Single Video Paper • 2503.11647 • Published 14 days ago • 123
FuseChat-3.0: Preference Optimization Meets Heterogeneous Model Fusion Paper • 2503.04222 • Published 23 days ago • 14
Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders Paper • 2503.03601 • Published 23 days ago • 220
Nature-Inspired Population-Based Evolution of Large Language Models Paper • 2503.01155 • Published 26 days ago • 1
Latent Space Disentanglement in Diffusion Transformers Enables Precise Zero-shot Semantic Editing Paper • 2411.08196 • Published Nov 12, 2024 • 1
FluxSpace: Disentangled Semantic Editing in Rectified Flow Transformers Paper • 2412.09611 • Published Dec 12, 2024 • 10
Deep Neuromorphic Networks with Superconducting Single Flux Quanta Paper • 2311.10721 • Published Sep 21, 2023 • 2