Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Jiahao004 's Collections
DeepTheorem

DeepTheorem

updated 20 days ago

A dataset and RL-zero pipeline for advanced mathematical reasoning of informal theorem proving.

Upvote
2

  • Jiahao004/DeepTheorem

    Viewer • Updated 25 days ago • 121k • 2.28k • 22

  • Jiahao004/DeepTheorem-qwen-1.5b-rl

    2B • Updated May 26 • 31 • 1

  • Jiahao004/DeepTheorem-qwen-3b-rl

    3B • Updated May 26 • 12

  • Jiahao004/DeepTheorem-qwen-7b-rl

    8B • Updated May 26 • 27 • 3

  • Jiahao004/HMMT_FIMO_Putnam

    Updated 25 days ago • 218 • 2

  • DeepTheorem: Advancing LLM Reasoning for Theorem Proving Through Natural Language and Reinforcement Learning

    Paper • 2505.23754 • Published May 29 • 16
Upvote
2
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs