Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
14
24
6
Gaotang Li
gaotang
Follow
zhexu3's profile picture
John6666's profile picture
zeyang1999's profile picture
7 followers
·
3 following
https://gaotangli.github.io/
GaotangLi
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
1 day ago
Code as Agent Harness
authored
a paper
7 days ago
RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond Verifiable Rewards
upvoted
a
paper
7 days ago
Useful Memories Become Faulty When Continuously Updated by LLMs
View all activity
Organizations
None yet
gaotang
's models
11
Sort: Recently updated
gaotang/deepseek-math-7b-base
Text Generation
•
7B
•
Updated
Aug 28, 2025
•
2
gaotang/RM-R1-DeepSeek-Distilled-Qwen-7B
Text Generation
•
8B
•
Updated
Jun 28, 2025
•
60
•
3
gaotang/RM-R1-Qwen2.5-Instruct-7B
Text Generation
•
8B
•
Updated
Jun 28, 2025
•
131
•
4
gaotang/RM-R1-DeepSeek-Distilled-Qwen-14B
Text Generation
•
15B
•
Updated
Jun 28, 2025
•
952
•
1
gaotang/RM-R1-Qwen2.5-Instruct-14B
Text Generation
•
15B
•
Updated
Jun 28, 2025
•
19
•
1
gaotang/RM-R1-Qwen2.5-Instruct-32B
Text Generation
•
33B
•
Updated
Jun 28, 2025
•
44
•
1
gaotang/RM-R1-DeepSeek-Distilled-Qwen-32B
Text Generation
•
33B
•
Updated
Jun 28, 2025
•
23
•
2
gaotang/qwen_7b_sky_filtered_code8k_math_10k_distilled_Claude_o3_0419
8B
•
Updated
Apr 19, 2025
•
1
gaotang/qwen_7b_sky_filtered_code8k_math_10k_distilled_OpenAI
8B
•
Updated
Apr 18, 2025
gaotang/qwen_14b_sky_filtered_code8k_math_10k_distilled_OpenAI
15B
•
Updated
Apr 18, 2025
gaotang/qwen2.5_14B_LR1.0e-6_evidence_rubric_4k2k_separate_reward_function
15B
•
Updated
Apr 9, 2025