Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
qingyangzhang 's Collections
EMPO

EMPO

updated 10 days ago
Upvote
1

  • johnsutor/natural_reasoning_categorized

    Viewer • Updated Mar 10 • 1.15M • 42 • 1

  • qingyangzhang/natural_reasoning_simple

    Viewer • Updated Apr 17 • 12.1k • 274

  • qingyangzhang/natural_reasoning_level_1

    Viewer • Updated Apr 18 • 68.5k • 35

  • AI-MO/NuminaMath-CoT

    Viewer • Updated Nov 25, 2024 • 860k • 2.97k • 450

  • Haitao999/DAPO-Math-17k-unique

    Viewer • Updated Mar 22 • 17.4k • 25 • 1

  • Haitao999/Qwen2.5-7B-EMPO-NM-COT-20K

    Text Generation • Updated Mar 31 • 9

  • Haitao999/Qwen2.5-7B-GRPO-NM-COT-20K-2epoch

    Text Generation • Updated Apr 2 • 4

  • hendrydong/gpqa_main_mc

    Viewer • Updated Jan 3 • 448 • 52 • 1

  • TIGER-Lab/MMLU-Pro

    Viewer • Updated Apr 6 • 12.1k • 52.2k • 354

  • Haitao999/Qwen2.5-7B-Instruct-EMPO-natural_reasoning_simple_from_base_general-verifier

    Text Generation • Updated Apr 18 • 54

  • Haitao999/Qwen2.5-14B-EMPO-Natural-Reasoning_simple_full

    Updated 18 days ago • 9 • 1

  • Haitao999/Qwen2.5-14B-GRPO-Natural-Reasoning

    Text Generation • Updated 13 days ago • 4

  • Right Question is Already Half the Answer: Fully Unsupervised LLM Reasoning Incentivization

    Paper • 2504.05812 • Published Apr 8 • 1
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs