Papers
AI & ML interests
R3 Model is all you need
Recent Activity
View all activity
models
66

rubricreward/LLaMA-3.2-3B-DPO-HelpSteer3-R3-Qwen3-14B-LoRA-4k
Text Generation
•
Updated
•
17

rubricreward/LLaMA-3.2-3B-DPO-HelpSteer3-R3-Qwen3-8B-14k
Text Generation
•
Updated
•
16

rubricreward/LLaMA-3.2-3B-DPO-HelpSteer3-R3-Qwen3-4B-14k
Text Generation
•
Updated
•
16

rubricreward/R3-DeepSeek-R1-Distill-Qwen-14B-LoRA-4k
15B
•
Updated
•
10

rubricreward/R3-DeepSeek-R1-Distill-Qwen-14B-LoRA-14k
15B
•
Updated
•
13

rubricreward/R3-DeepSeek-R1-Distill-Qwen-14B-14k
Text Generation
•
15B
•
Updated
•
20

rubricreward/R3-DeepSeek-R1-Distill-Qwen-14B-4k
Text Generation
•
15B
•
Updated
•
14

rubricreward/R3-Phi-4-reasoning-plus-LoRA-14k
15B
•
Updated
•
16

rubricreward/R3-Qwen3-14B-LoRA-14k
15B
•
Updated
•
18

rubricreward/R3-Qwen3-8B-LoRA-14k
Text Generation
•
8B
•
Updated
•
13
•
2
datasets
166
rubricreward/mR3-Dataset-100K-StartEng-EasyToHard
Viewer
•
Updated
•
100k
•
29
rubricreward/mR3-Dataset-100K-StartEng-HardToEasy
Viewer
•
Updated
•
100k
•
29
rubricreward/mR3-Dataset-100K-EasyToHard
Viewer
•
Updated
•
100k
•
28
rubricreward/mR3-Dataset-100K-HardToEasy
Viewer
•
Updated
•
100k
•
22
rubricreward/mR3-Dataset-100K-StartEng
Viewer
•
Updated
•
100k
•
30
rubricreward/mR3-Dataset-100K
Viewer
•
Updated
•
100k
•
36
rubricreward/mR3-Dataset-100K-Truncated
Updated
•
2
rubricreward/mR3-Dataset-Cleaned
Viewer
•
Updated
•
100k
•
42
rubricreward/mR3-Dataset-Filtered3
Viewer
•
Updated
•
441k
•
35
rubricreward/mR3-Dataset-Filtered2
Viewer
•
Updated
•
645k
•
40