LettinGo: Explore User Profile Generation for Recommendation System Paper • 2506.18309 • Published 4 days ago • 9
RePrompt: Reasoning-Augmented Reprompting for Text-to-Image Generation via Reinforcement Learning Paper • 2505.17540 • Published May 23 • 7
Text2Grad: Reinforcement Learning from Natural Language Feedback Paper • 2505.22338 • Published 29 days ago • 7
Distill Not Only Data but Also Rewards: Can Smaller Language Models Surpass Larger Ones? Paper • 2502.19557 • Published Feb 26 • 2
VEM: Environment-Free Exploration for Training GUI Agent with Value Environment Model Paper • 2502.18906 • Published Feb 26 • 12
Lean and Mean: Decoupled Value Policy Optimization with Global Value Guidance Paper • 2502.16944 • Published Feb 24 • 10
Large Action Models: From Inception to Implementation Paper • 2412.10047 • Published Dec 13, 2024 • 36
Everything of Thoughts: Defying the Law of Penrose Triangle for Thought Generation Paper • 2311.04254 • Published Nov 7, 2023 • 16