Cosmos-Transfer1: Conditional World Generation with Adaptive Multimodal Control Paper • 2503.14492 • Published Mar 18 • 19
Cosmos-Transfer1: Conditional World Generation with Adaptive Multimodal Control Paper • 2503.14492 • Published Mar 18 • 19
Direct Discriminative Optimization: Your Likelihood-Based Visual Generative Model is Secretly a GAN Discriminator Paper • 2503.01103 • Published Mar 3 • 5
XCube ($\mathcal{X}^3$): Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies Paper • 2312.03806 • Published Dec 6, 2023 • 1
LATTE3D: Large-scale Amortized Text-To-Enhanced3D Synthesis Paper • 2403.15385 • Published Mar 22, 2024 • 8
Visual Fact Checker: Enabling High-Fidelity Detailed Caption Generation Paper • 2404.19752 • Published Apr 30, 2024 • 25
Edify Image: High-Quality Image Generation with Pixel Space Laplacian Diffusion Models Paper • 2411.07126 • Published Nov 11, 2024 • 31
LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models Paper • 2411.09595 • Published Nov 14, 2024 • 76
Contrastive Energy Prediction for Exact Energy-Guided Diffusion Sampling in Offline Reinforcement Learning Paper • 2304.12824 • Published Apr 25, 2023
Score Regularized Policy Optimization through Diffusion Behavior Paper • 2310.07297 • Published Oct 11, 2023 • 1
Noise Contrastive Alignment of Language Models with Explicit Rewards Paper • 2402.05369 • Published Feb 8, 2024 • 1
RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation Paper • 2410.07864 • Published Oct 10, 2024 • 1
Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control Paper • 2407.09024 • Published Jul 12, 2024