Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
2
3
Pragya Srivastava
pragsri8
Follow
0 followers
·
5 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
2 days ago
Robust Reward Modeling via Causal Rubrics
authored
a paper
2 days ago
Robust Reward Modeling via Causal Rubrics
commented
on
a paper
2 days ago
Robust Reward Modeling via Causal Rubrics
View all activity
Organizations
pragsri8
's models
11
Sort: Recently updated
pragsri8/Llama-3.1-8B-Instruct-GenRM-Reproduce
Updated
Apr 17
pragsri8/llama-3.1-8b-sft-full_noisy_2.0_bon-sft
Text Generation
•
Updated
Apr 11
•
11
pragsri8/llama-3.1-8b-sft-full_noisy_1.0_bon-sft
Text Generation
•
Updated
Apr 10
•
10
pragsri8/llama-3.1-8b-sft-full_noisy_0.5_bon-sft
Text Generation
•
Updated
Apr 10
•
10
pragsri8/llama-3.1-8b-sft-full_vanilla_bon-sft
Text Generation
•
Updated
Apr 10
•
10
pragsri8/math_sft
Updated
Nov 21, 2024
pragsri8/llama-3.2-1B-RM-Ultrafeedback
Updated
Nov 3, 2024
pragsri8/llama70b_finetuned
Updated
Oct 1, 2024
pragsri8/dpo_offline_20k
Updated
Jul 20, 2024
pragsri8/phi2_hh-rlhf_sft
Updated
Jul 16, 2024
pragsri8/phi-1_5_rm-gpt2_100
Updated
Jun 18, 2024