Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
24
126
37
KABI
dongguanting
Follow
ankits0052's profile picture
RichardQRQ's profile picture
Keven16's profile picture
40 followers
·
76 following
https://dongguanting.github.io/
kakakbibibi
dongguanting
AI & ML interests
Reasoning and Alignment for Large Language Models
Recent Activity
upvoted
a
paper
about 19 hours ago
Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving
upvoted
a
paper
2 days ago
HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels
commented
on
a paper
3 days ago
Agentic Reinforced Policy Optimization
View all activity
Organizations
dongguanting
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
3 datasets
8 days ago
dongguanting/ARPO-SFT-54K
Viewer
•
Updated
4 days ago
•
54.6k
•
212
•
6
dongguanting/ARPO-RL-DeepSearch-1K
Viewer
•
Updated
4 days ago
•
1.07k
•
127
•
2
dongguanting/ARPO-RL-Reasoning-10K
Viewer
•
Updated
4 days ago
•
10k
•
114
•
1
liked
3 models
8 days ago
dongguanting/Llama3.1-8B-ARPO
8B
•
Updated
4 days ago
•
4
•
1
dongguanting/Qwen3-14B-ARPO-DeepSearch
15B
•
Updated
4 days ago
•
9
•
3
dongguanting/Qwen2.5-7B-ARPO
Text Generation
•
8B
•
Updated
4 days ago
•
17
•
2
liked
2 models
9 days ago
dongguanting/Qwen3-8B-ARPO-DeepSearch
8B
•
Updated
4 days ago
•
6
•
1
dongguanting/Qwen2.5-3B-ARPO
3B
•
Updated
4 days ago
•
5
•
1
liked
3 models
30 days ago
dongguanting/Tool-Star-Qwen-1.5B
Text Generation
•
2B
•
Updated
Jun 6
•
5
•
2
dongguanting/Tool-Star-Qwen-0.5B
Text Generation
•
0.6B
•
Updated
Jun 6
•
14
•
1
dongguanting/Tool-Star-Qwen-7B
Text Generation
•
8B
•
Updated
Jun 30
•
553
•
2
liked
a dataset
about 1 month ago
basicv8vc/SimpleQA
Viewer
•
Updated
Nov 5, 2024
•
4.33k
•
3.85k
•
20
liked
2 datasets
2 months ago
dongguanting/Tool-Star-SFT-54K
Viewer
•
Updated
May 29
•
54k
•
232
•
7
dongguanting/Multi-Tool-RL-10K
Viewer
•
Updated
May 25
•
10k
•
105
•
2
liked
2 models
2 months ago
mradermacher/Tool-Star-Qwen-3B-GGUF
3B
•
Updated
May 25
•
80
•
3
dongguanting/Tool-Star-Qwen-3B
Text Generation
•
3B
•
Updated
May 25
•
332
•
5
liked
a dataset
3 months ago
dongguanting/RAG-Error-Critic-100K
Viewer
•
Updated
Jun 28
•
100k
•
40
•
2
liked
a dataset
6 months ago
jinzhuoran/RAG-RewardBench
Viewer
•
Updated
Dec 23, 2024
•
1.49k
•
257
•
11
liked
2 models
7 months ago
Haon-Chen/speed-embedding-7b-instruct
Feature Extraction
•
7B
•
Updated
Nov 3, 2024
•
15
•
5
yulan-team/YuLan-Mini
Text Generation
•
2B
•
Updated
Mar 27
•
31
•
37
Load more