Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
4
6
Peter L. Chen
PeterLauLukCh
Follow
ElisaCheung's profile picture
John6666's profile picture
2 followers
·
5 following
https://peterlaulukchen.github.io/
AI & ML interests
None yet
Recent Activity
published
a dataset
2 days ago
Simu-Env/ALFWorld-SFT
published
a dataset
2 days ago
Simu-Env/ALFWorld-RLVR
commented
on
a paper
17 days ago
ComPO: Preference Alignment via Comparison Oracles
View all activity
Organizations
PeterLauLukCh
's models
52
Sort: Recently updated
PeterLauLukCh/Qwen2.5-14B-Instruct-RL-raw
15B
•
Updated
Apr 29
•
1
PeterLauLukCh/Qwen2.5-14B-Instruct-CogRL-v0.1
15B
•
Updated
Apr 29
•
1
PeterLauLukCh/Qwen2.5-14B-Instruct-CogRL-v0.2
15B
•
Updated
Apr 29
•
1
PeterLauLukCh/Qwen2.5-14B-Instruct-SFT
15B
•
Updated
Apr 29
•
1
PeterLauLukCh/Qwen2.5-14B-Instruct-RL-1
15B
•
Updated
Apr 24
•
1
PeterLauLukCh/Qwen2.5-14B-Instruct-CognitiveRL-v0.1
15B
•
Updated
Apr 22
•
1
•
1
PeterLauLukCh/Qwen2.5-14B-Instruct-CognitiveSFT-v0.1
15B
•
Updated
Apr 21
•
2
•
2
PeterLauLukCh/Qwen2.5-14B-Instruct-LIMO-new
15B
•
Updated
Apr 20
•
1
•
1
PeterLauLukCh/Qwen2.5-14B-Instruct-4o-1
15B
•
Updated
Apr 20
•
1
•
1
PeterLauLukCh/Qwen2.5-14B-Instruct-o4
15B
•
Updated
Apr 20
•
1
•
1
PeterLauLukCh/Qwen2.5-14B-Instruct-LIMO
15B
•
Updated
Apr 18
•
1
PeterLauLukCh/Qwen2.5-14B-Instruct-Habit
15B
•
Updated
Apr 18
•
1
PeterLauLukCh/Qwen2.5-14B-Instruct-Geo-Alg-LIMO
15B
•
Updated
Apr 17
•
1
PeterLauLukCh/Qwen2.5-14B-Instruct-Geo-Alg-Habit
15B
•
Updated
Apr 17
PeterLauLukCh/Gemma-2-9b-it-SimPO-ComPO-2
10B
•
Updated
Apr 15
•
1
PeterLauLukCh/Qwen2.5-14B-Instruct-Geo-LIMO
15B
•
Updated
Apr 13
•
1
PeterLauLukCh/Qwen2.5-14B-Instruct-Geo-Habit
15B
•
Updated
Apr 13
•
1
PeterLauLukCh/Qwen2.5-14B-Instruct-Geo
15B
•
Updated
Apr 11
•
1
PeterLauLukCh/Gemma-2-Instruct-9B-SimPO-ComPO
10B
•
Updated
Apr 10
•
1
PeterLauLukCh/Qwen2.5-Instruct-14B-SFT
15B
•
Updated
Apr 4
•
1
PeterLauLukCh/Qwen2.5-Instruct-32B-SimPO-7K
33B
•
Updated
Mar 22
•
1
PeterLauLukCh/Qwen2.5-Instruct-32B-DPO-4K
33B
•
Updated
Mar 22
Previous
1
2
Next