Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
10
1
Jiarui Yao
FlippyDora
Follow
research4pan's profile picture
1 follower
·
17 following
AI & ML interests
None yet
Recent Activity
updated
a model
1 day ago
PRM-CoT/Qwen2.5-Math-7B-prm-n5-eta100-stepLen256-step500
published
a model
1 day ago
PRM-CoT/Qwen2.5-Math-7B-prm-n5-eta100-stepLen256-step500
updated
a model
2 days ago
PRM-CoT/Qwen2.5-Math-7B-grpo-n5-step500
View all activity
Organizations
FlippyDora
's models
60
Sort: Recently updated
FlippyDora/Qwen1.5B-Inst_numina_raft1_orig_eos
Text Generation
•
2B
•
Updated
Mar 6
•
9
FlippyDora/qwen_sft_1
Text Generation
•
8B
•
Updated
Mar 4
•
6
FlippyDora/qwen_sft_2
Text Generation
•
8B
•
Updated
Mar 4
•
6
FlippyDora/Qwen_numina_raft3_orig_eos
Text Generation
•
8B
•
Updated
Mar 1
•
8
FlippyDora/Qwen_numina_raft2_orig_eos
Text Generation
•
8B
•
Updated
Mar 1
•
6
FlippyDora/3B_rpr_mixtureBT_criteria_loadBalance0.5_epoch5_k10
3B
•
Updated
Feb 24
•
3
FlippyDora/3B_rpr_mixtureBT_attr_loadBalance0.5_epoch5_k5
3B
•
Updated
Feb 24
•
3
FlippyDora/3B_rpr_mixtureBT_attr_loadBalance0.5_epoch5_k10
3B
•
Updated
Feb 24
•
3
FlippyDora/3B_mixtureBT_rpr_criteria_k5_epoch5_loadBalance0.5
3B
•
Updated
Feb 22
•
2
FlippyDora/3B_mixtureBT_helpsteer2_pkusafe_attr_heads6_loadBalance0.5
3B
•
Updated
Feb 12
•
3
FlippyDora/3B_mixtureBT_rpr_criteria_epoch5_loadBalance0.5
3B
•
Updated
Feb 10
•
3
FlippyDora/3B_rpr_mixtureBT_attr_loadBalance0.5
3B
•
Updated
Feb 8
•
3
FlippyDora/3B_helpsteer2_mixtureBT_attr_loadBalance0.5
3B
•
Updated
Feb 8
•
3
FlippyDora/CoT_Translator
7B
•
Updated
Feb 6
•
4
FlippyDora/CoT_Prover
7B
•
Updated
Feb 4
•
3
FlippyDora/dpo_rm
3B
•
Updated
Jan 21
•
2
FlippyDora/dpo_remove
3B
•
Updated
Jan 19
•
4
FlippyDora/origin_preference700k
3B
•
Updated
Jan 18
•
3
FlippyDora/MixtureBT_preference700k_LoadBalance0.5
3B
•
Updated
Jan 18
•
3
FlippyDora/MathLLM-StatementTranslator-7B-v0.1
7B
•
Updated
Jan 17
•
3
FlippyDora/MixtureBT_Helpsteer2_LoadBalance0.5
3B
•
Updated
Jan 16
•
3
FlippyDora/step_dpo_mistral_lr1e-7_step200
7B
•
Updated
Dec 5, 2024
•
3
FlippyDora/step_dpo_mistral_lr1e-7_step100
7B
•
Updated
Dec 5, 2024
•
3
FlippyDora/mdpo
3B
•
Updated
Nov 21, 2024
•
3
FlippyDora/mdpo_guess_cities
3B
•
Updated
Nov 21, 2024
•
3
FlippyDora/dpo-rm-translate
Updated
Nov 17, 2024
FlippyDora/gemma-2b-it_lora_r128_lr5e-4_dpo
Updated
Oct 23, 2024
•
2
FlippyDora/gemma-2b-it_lora_r32_lr5e-4_dpo
Updated
Oct 22, 2024
•
2
FlippyDora/gemma-2b-it_lora_r16_lr5e-4_dpo
Updated
Oct 22, 2024
•
2
FlippyDora/gemma-2b-it_lr1e-5_ultrafeedback
3B
•
Updated
Oct 16, 2024
•
3
Previous
1
2
Next