RainbowPlus: Enhancing Adversarial Prompt Generation via Evolutionary Quality-Diversity Search Paper • 2504.15047 • Published 25 days ago • 6
Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't Paper • 2503.16219 • Published Mar 20 • 48