-
ReZero: Enhancing LLM search ability by trying one-more-time
Paper • 2504.11001 • Published • 14 -
FonTS: Text Rendering with Typography and Style Controls
Paper • 2412.00136 • Published -
GenEx: Generating an Explorable World
Paper • 2412.09624 • Published • 97 -
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference
Paper • 2412.13663 • Published • 149
Collections
Discover the best community collections!
Collections including paper arxiv:2504.14509
-
ID-Animator: Zero-Shot Identity-Preserving Human Video Generation
Paper • 2404.15275 • Published -
IDAdapter: Learning Mixed Features for Tuning-Free Personalization of Text-to-Image Models
Paper • 2403.13535 • Published • 24 -
PhotoVerse: Tuning-Free Image Customization with Text-to-Image Diffusion Models
Paper • 2309.05793 • Published • 50 -
GHOST 2.0: generative high-fidelity one shot transfer of heads
Paper • 2502.18417 • Published • 67
-
Can Large Language Models Understand Context?
Paper • 2402.00858 • Published • 24 -
OLMo: Accelerating the Science of Language Models
Paper • 2402.00838 • Published • 84 -
Self-Rewarding Language Models
Paper • 2401.10020 • Published • 148 -
SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity
Paper • 2401.17072 • Published • 25
-
Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image Synthesis
Paper • 2401.09048 • Published • 10 -
Improving fine-grained understanding in image-text pre-training
Paper • 2401.09865 • Published • 17 -
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data
Paper • 2401.10891 • Published • 61 -
Scaling Up to Excellence: Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild
Paper • 2401.13627 • Published • 75