SQuARE: Sequential Question Answering Reasoning Engine for Enhanced Chain-of-Thought in Large Language Models Paper • 2502.09390 • Published 5 days ago • 14
Tools for learning AI Collection This is a collection of tools on the hub that teachers and students can use to learn AI! • 9 items • Updated about 19 hours ago • 55
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach Paper • 2502.05171 • Published 10 days ago • 106
Great Models Think Alike and this Undermines AI Oversight Paper • 2502.04313 • Published 11 days ago • 28
Gold-medalist Performance in Solving Olympiad Geometry with AlphaGeometry2 Paper • 2502.03544 • Published 12 days ago • 40
Analyze Feature Flow to Enhance Interpretation and Steering in Language Models Paper • 2502.03032 • Published 13 days ago • 54
Large Language Model Guided Self-Debugging Code Generation Paper • 2502.02928 • Published 13 days ago • 10
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published 13 days ago • 179
Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate Paper • 2501.17703 • Published 20 days ago • 54
Towards General-Purpose Model-Free Reinforcement Learning Paper • 2501.16142 • Published 22 days ago • 26
SRMT: Shared Memory for Multi-agent Lifelong Pathfinding Paper • 2501.13200 • Published 26 days ago • 63
Control LLM: Controlled Evolution for Intelligence Retention in LLM Paper • 2501.10979 • Published 30 days ago • 6
O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning Paper • 2501.12570 • Published 27 days ago • 24
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published 27 days ago • 321