A Technical Study into Small Reasoning Language Models Paper • 2506.13404 • Published 11 days ago • 9
SQL-R1 Collection [arXiv] Official Huggingface Repository for "SQL-R1: Training Natural Language to SQL Reasoning Model By Reinforcement Learning" • 5 items • Updated 12 days ago • 3
ReEx-SQL: Reasoning with Execution-Aware Reinforcement Learning for Text-to-SQL Paper • 2505.12768 • Published May 19 • 3
SQL-R1: Training Natural Language to SQL Reasoning Model By Reinforcement Learning Paper • 2504.08600 • Published Apr 11 • 29
Mask-Enhanced Autoregressive Prediction: Pay Less Attention to Learn More Paper • 2502.07490 • Published Feb 11 • 9
SPAM: Spike-Aware Adam with Momentum Reset for Stable LLM Training Paper • 2501.06842 • Published Jan 12 • 16