Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
dongguanting 's Collections
ARPO
Tool-Star
RAG-Critic

ARPO

updated about 21 hours ago

The official datasets and model checkpoints of ARPO

Upvote
2

  • Agentic Reinforced Policy Optimization

    Paper • 2507.19849 • Published 4 days ago • 93

  • dongguanting/Qwen3-8B-ARPO-DeepSearch

    8B • Updated 1 day ago • 5 • 1

  • dongguanting/Qwen3-14B-ARPO-DeepSearch

    15B • Updated 1 day ago • 5 • 3

  • dongguanting/Qwen2.5-7B-ARPO

    Text Generation • 8B • Updated about 22 hours ago • 7 • 2

  • dongguanting/Llama3.1-8B-ARPO

    8B • Updated 1 day ago • 4 • 1

  • dongguanting/Qwen2.5-3B-ARPO

    3B • Updated 1 day ago • 4 • 1

  • dongguanting/ARPO-SFT-54K

    Viewer • Updated 1 day ago • 54.6k • 113 • 4

  • dongguanting/ARPO-RL-Reasoning-10K

    Viewer • Updated about 22 hours ago • 10k • 80 • 1

  • dongguanting/ARPO-RL-DeepSearch-1K

    Viewer • Updated about 22 hours ago • 1.07k • 89 • 2
Upvote
2
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs