Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
33
152
43
KABI
dongguanting
Follow
Haon-Chen's profile picture
RichardQRQ's profile picture
ankits0052's profile picture
47 followers
·
85 following
https://dongguanting.github.io/
kakakbibibi
dongguanting
AI & ML interests
Reasoning and Alignment for Large Language Models
Recent Activity
upvoted
a
paper
10 days ago
UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning
upvoted
a
paper
10 days ago
R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning
upvoted
a
paper
10 days ago
A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers
View all activity
Organizations
dongguanting
's models
10
Sort: Recently updated
dongguanting/Qwen2.5-7B-ARPO
Text Generation
•
8B
•
Updated
26 days ago
•
41
•
2
dongguanting/Llama3.1-8B-ARPO
Text Generation
•
8B
•
Updated
Aug 12
•
16
•
1
dongguanting/Qwen2.5-3B-ARPO
Text Generation
•
3B
•
Updated
Aug 12
•
25
•
1
dongguanting/Qwen3-14B-ARPO-DeepSearch
Text Generation
•
15B
•
Updated
Aug 12
•
60
•
4
dongguanting/Qwen3-8B-ARPO-DeepSearch
8B
•
Updated
Jul 29
•
38
•
2
dongguanting/Tool-Star-Qwen-7B
Text Generation
•
8B
•
Updated
Jun 30
•
25
•
2
dongguanting/RAG-Critic-3B
Text Generation
•
3B
•
Updated
Jun 28
•
12
•
3
dongguanting/Tool-Star-Qwen-0.5B
Text Generation
•
0.6B
•
Updated
Jun 6
•
10
•
1
dongguanting/Tool-Star-Qwen-1.5B
Text Generation
•
2B
•
Updated
Jun 6
•
44
•
2
dongguanting/Tool-Star-Qwen-3B
Text Generation
•
3B
•
Updated
May 25
•
16
•
5