Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Sujay Shahare's picture
6 7

Sujay Shahare

sujayshahare
Mi6paulino's profile picture Gargaz's profile picture
·
  • sujay_shahare
  • SujayShahare
  • sujay-shahare

AI & ML interests

LLM post training, RL

Recent Activity

updated a collection 4 days ago
Reasoning Datasets
upvoted a collection 6 days ago
SYNTHETIC-1
updated a collection 6 days ago
Reasoning Datasets
View all activity

Organizations

None yet

Collections 3

Reasoning Datasets
A collection of reasoning datasets used for post training base models using RL
  • allenai/tulu-3-sft-mixture

    Viewer • Updated Dec 2, 2024 • 939k • 9.76k • 152
  • GeneralReasoning/GeneralThought-195K

    Viewer • Updated Mar 10 • 195k • 166 • 69
  • a-m-team/AM-DeepSeek-R1-Distilled-1.4M

    Preview • Updated Mar 30 • 3.17k • 145
  • PrimeIntellect/SYNTHETIC-1

    Viewer • Updated Feb 21 • 1.99M • 919 • 55
Reasoning LLMs
Collection of research papers I find interesting on reasoning models
  • Learning to Reason without External Rewards

    Paper • 2505.19590 • Published 19 days ago • 27
  • Scalable Best-of-N Selection for Large Language Models via Self-Certainty

    Paper • 2502.18581 • Published Feb 25
  • Training Large Language Models to Reason in a Continuous Latent Space

    Paper • 2412.06769 • Published Dec 9, 2024 • 86
  • Fractured Chain-of-Thought Reasoning

    Paper • 2505.12992 • Published 26 days ago • 21

models 0

None public yet

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs