Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Project of MoE reward model
Activity Feed
Request to join this org
Follow
6
AI & ML interests
None defined yet.
Recent Activity
shengyi-qian
updated
a model
4 days ago
MoeReward/rl_checkpoints
shengyi-qian
published
a model
9 days ago
MoeReward/rl_checkpoints
zyhang1998
updated
a dataset
14 days ago
MoeReward/combined_rlhf_dataset_grpo_imdb_main
View all activity
Team members
5
models
6
Sort: Recently updated
MoeReward/rl_checkpoints
Updated
4 days ago
MoeReward/lora_checkpoint
Updated
17 days ago
MoeReward/reward_lora_qwen_1_5_base
Updated
26 days ago
•
6
MoeReward/reward_qwen_1_5
Updated
30 days ago
•
6
MoeReward/reward_lora_qwen_1_5
Updated
30 days ago
•
6
MoeReward/sft_full_param_qwen_1_5
Updated
about 1 month ago
•
9
datasets
49
Sort: Recently updated
MoeReward/combined_rlhf_dataset_grpo_imdb_main
Viewer
•
Updated
14 days ago
•
4k
•
87
MoeReward/combined_rlhf_dataset_grpo_metamath_main
Viewer
•
Updated
14 days ago
•
4k
•
79
MoeReward/combined_rlhf_dataset_grpo_arc_main
Viewer
•
Updated
14 days ago
•
4k
•
73
MoeReward/combined_rlhf_dataset_grpo_nq_main
Viewer
•
Updated
14 days ago
•
4k
•
69
MoeReward/combined_rlhf_dataset_grpo_equal_dist
Viewer
•
Updated
14 days ago
•
4k
•
51
MoeReward/preference_dataset_stepmath_ood
Viewer
•
Updated
14 days ago
•
10.8k
•
47
MoeReward/combined_preference_dataset_ood
Updated
14 days ago
•
23
MoeReward/combined_rlhf_dataset_alpaca
Viewer
•
Updated
15 days ago
•
52k
•
45
MoeReward/combined_rlhf_dataset_math
Viewer
•
Updated
15 days ago
•
40k
•
56
MoeReward/combined_rlhf_dataset_code
Viewer
•
Updated
15 days ago
•
20k
•
48
Expand 49 datasets