Gaotang Li's picture

Gaotang Li

gaotang

·

https://gaotangli.github.io/

GaotangLi

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Code as Agent Harness

authored a paper 7 days ago

RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond Verifiable Rewards

upvoted a paper 7 days ago

Useful Memories Become Faulty When Continuously Updated by LLMs

View all activity

Organizations

None yet

gaotang 's models 11

gaotang/deepseek-math-7b-base

Text Generation • 7B • Updated Aug 28, 2025 • 2

gaotang/RM-R1-DeepSeek-Distilled-Qwen-7B

Text Generation • 8B • Updated Jun 28, 2025 • 60 • 3

gaotang/RM-R1-Qwen2.5-Instruct-7B

Text Generation • 8B • Updated Jun 28, 2025 • 131 • 4

gaotang/RM-R1-DeepSeek-Distilled-Qwen-14B

Text Generation • 15B • Updated Jun 28, 2025 • 952 • 1

gaotang/RM-R1-Qwen2.5-Instruct-14B

Text Generation • 15B • Updated Jun 28, 2025 • 19 • 1

gaotang/RM-R1-Qwen2.5-Instruct-32B

Text Generation • 33B • Updated Jun 28, 2025 • 44 • 1

gaotang/RM-R1-DeepSeek-Distilled-Qwen-32B

Text Generation • 33B • Updated Jun 28, 2025 • 23 • 2

gaotang/qwen_7b_sky_filtered_code8k_math_10k_distilled_Claude_o3_0419

8B • Updated Apr 19, 2025 • 1

gaotang/qwen_7b_sky_filtered_code8k_math_10k_distilled_OpenAI

8B • Updated Apr 18, 2025

gaotang/qwen_14b_sky_filtered_code8k_math_10k_distilled_OpenAI

15B • Updated Apr 18, 2025

gaotang/qwen2.5_14B_LR1.0e-6_evidence_rubric_4k2k_separate_reward_function

15B • Updated Apr 9, 2025