Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
4
1
Owen Oertell
ojo2
Follow
DaiYijia's profile picture
lunarflu's profile picture
2 followers
·
1 following
https://owenoertell.com
AI & ML interests
RL
Recent Activity
upvoted
a
paper
3 days ago
Pre-trained Large Language Models Learn Hidden Markov Models In-context
liked
a dataset
4 months ago
AI-MO/NuminaMath-CoT
authored
a paper
12 months ago
REBEL: Reinforcement Learning via Regressing Relative Rewards
View all activity
Organizations
None yet
Papers
3
arxiv:
2404.16767
arxiv:
2404.08495
arxiv:
2404.03673
models
2
Sort: Recently updated
ojo2/dpo_summarization
Updated
Jan 15, 2024
ojo2/gptj_summarize_sft
Text Generation
•
Updated
Dec 12, 2023
•
29
datasets
0
None public yet