Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
RLHF-And-Friends
community
Activity Feed
Follow
7
AI & ML interests
None defined yet.
Recent Activity
evgurov
updated
a model
about 2 months ago
RLHF-And-Friends/RM-TLDR-TLDR-Qwen2-0.5B-SmallSFT-lr-1e-5
evgurov
published
a model
about 2 months ago
RLHF-And-Friends/RM-TLDR-TLDR-Qwen2-0.5B-SmallSFT-lr-1e-5
evgurov
updated
a model
about 2 months ago
RLHF-And-Friends/RM-TLDR-TLDR-Qwen2-0.5B-SmallSFT
View all activity
Team members
4
RLHF-And-Friends
's datasets
13
Sort: Recently updated
RLHF-And-Friends/alpaca-cleaned
Viewer
•
Updated
May 19
•
51.8k
•
3
RLHF-And-Friends/tldr-thematic
Viewer
•
Updated
May 19
•
130k
•
24
RLHF-And-Friends/wiki-lingua-ppo
Viewer
•
Updated
May 14
•
493k
•
5
RLHF-And-Friends/wiki-lingua-reward
Viewer
•
Updated
May 13
•
77k
•
11
RLHF-And-Friends/wiki-lingua-preference
Viewer
•
Updated
May 13
•
77k
•
5
RLHF-And-Friends/wiki-lingua-paired
Viewer
•
Updated
May 13
•
77k
•
8
RLHF-And-Friends/wiki-lingua
Viewer
•
Updated
May 10
•
742k
•
7
RLHF-And-Friends/helpsteer3-multilingual
Viewer
•
Updated
May 7
•
8.06k
•
22
RLHF-And-Friends/helpsteer3-code
Viewer
•
Updated
May 7
•
8.86k
•
17
•
2
RLHF-And-Friends/tldr-ppo
Viewer
•
Updated
Apr 19
•
113k
•
45
RLHF-And-Friends/tldr-sft
Viewer
•
Updated
Apr 19
•
25.3k
•
4
RLHF-And-Friends/ultrachat-preprocessed
Viewer
•
Updated
Apr 8
•
515k
•
7
RLHF-And-Friends/tldr-preference
Viewer
•
Updated
Feb 24
•
265k
•
4