Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
2
3
Pragya Srivastava
pragsri8
Follow
0 followers
·
5 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
2 days ago
Robust Reward Modeling via Causal Rubrics
authored
a paper
2 days ago
Robust Reward Modeling via Causal Rubrics
commented
on
a paper
2 days ago
Robust Reward Modeling via Causal Rubrics
View all activity
Organizations
Papers
2
arxiv:
2506.16507
arxiv:
2501.00658
models
11
Sort: Recently updated
pragsri8/Llama-3.1-8B-Instruct-GenRM-Reproduce
Updated
Apr 17
pragsri8/llama-3.1-8b-sft-full_noisy_2.0_bon-sft
Text Generation
•
Updated
Apr 11
•
11
pragsri8/llama-3.1-8b-sft-full_noisy_1.0_bon-sft
Text Generation
•
Updated
Apr 10
•
10
pragsri8/llama-3.1-8b-sft-full_noisy_0.5_bon-sft
Text Generation
•
Updated
Apr 10
•
10
pragsri8/llama-3.1-8b-sft-full_vanilla_bon-sft
Text Generation
•
Updated
Apr 10
•
10
pragsri8/math_sft
Updated
Nov 21, 2024
pragsri8/llama-3.2-1B-RM-Ultrafeedback
Updated
Nov 3, 2024
pragsri8/llama70b_finetuned
Updated
Oct 1, 2024
pragsri8/dpo_offline_20k
Updated
Jul 20, 2024
pragsri8/phi2_hh-rlhf_sft
Updated
Jul 16, 2024
View 11 models
datasets
21
Sort: Recently updated
pragsri8/codeultrafeedback_test_split
Viewer
•
Updated
May 12
•
1k
•
77
pragsri8/WildBenchGenv2-hard-matched
Viewer
•
Updated
May 11
•
6.36k
•
19
pragsri8/WildBenchGenv2-hard
Viewer
•
Updated
May 10
•
6.61k
•
145
pragsri8/WildBench_processedv2
Viewer
•
Updated
May 10
•
256
•
46
pragsri8/omnimathv2
Viewer
•
Updated
May 8
•
1k
•
28
pragsri8/olympiadbenchv2
Viewer
•
Updated
May 8
•
1k
•
27
pragsri8/mathv2
Viewer
•
Updated
May 8
•
1k
•
25
pragsri8/gsm8kv2
Viewer
•
Updated
May 8
•
400
•
28
pragsri8/omnimath
Viewer
•
Updated
May 8
•
1k
•
26
pragsri8/olympiadbench
Viewer
•
Updated
May 8
•
1k
•
28
View 21 datasets