Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
sujayshahare 's Collections
Reasoning Datasets
Reasoning LLMs
Agents

Reasoning LLMs

updated 5 days ago

Collection of research papers I find interesting on reasoning models

Upvote
-

  • Learning to Reason without External Rewards

    Paper • 2505.19590 • Published 18 days ago • 27

  • Scalable Best-of-N Selection for Large Language Models via Self-Certainty

    Paper • 2502.18581 • Published Feb 25

  • Training Large Language Models to Reason in a Continuous Latent Space

    Paper • 2412.06769 • Published Dec 9, 2024 • 86

  • Fractured Chain-of-Thought Reasoning

    Paper • 2505.12992 • Published 25 days ago • 21

  • OpenThoughts: Data Recipes for Reasoning Models

    Paper • 2506.04178 • Published 9 days ago • 39

  • SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

    Paper • 2501.17161 • Published Jan 28 • 122

  • Kimi k1.5: Scaling Reinforcement Learning with LLMs

    Paper • 2501.12599 • Published Jan 22 • 118

  • TÜLU 3: Pushing Frontiers in Open Language Model Post-Training

    Paper • 2411.15124 • Published Nov 22, 2024 • 65
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs