Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
3
5
Ryan Koo
rngusry
Follow
ray075hl's profile picture
1 follower
·
2 following
https://kooryan.netlify.app
kooryan
AI & ML interests
NLP, RLHF, Alignment
Recent Activity
authored
a paper
1 day ago
Decoding the End-to-end Writing Trajectory in Scholarly Manuscripts
authored
a paper
1 day ago
Learning Explainable Dense Reward Shapes via Bayesian Optimization
upvoted
a
paper
3 days ago
LawFlow : Collecting and Simulating Lawyers' Thought Processes
View all activity
Organizations
Papers
5
arxiv:
2504.16272
arxiv:
2401.14698
arxiv:
2309.17012
arxiv:
2305.09857
Expand 5 papers
models
4
Sort: Recently updated
rngusry/llama-3.2-3b-ultrafeedback-rm
Updated
25 days ago
rngusry/llama-3.1-1b-ultrafeedback-rm
Updated
Mar 25
rngusry/llama3.2-1b-instruct-hh-sft
Text Generation
•
Updated
Jan 22
•
4
rngusry/qwen2.5-hh-rm
Updated
Jan 21
•
6
datasets
3
Sort: Recently updated
rngusry/UltraFeedback-honesty-preferences
Viewer
•
Updated
Aug 3, 2024
•
251k
•
20
•
1
rngusry/UltraFeedback-instruction_following-preferences
Viewer
•
Updated
Jul 25, 2024
•
297k
•
25
rngusry/UltraFeedback-truthfulness-preferences
Viewer
•
Updated
Jul 25, 2024
•
217k
•
19
•
1