·
AI & ML interests
None yet
Organizations
rd211/Qwen2.5-7B-Instruct-HardLambda0.25-298
rd211/Qwen2.5-7B-Instruct-Lambda1.5-316
rd211/Qwen2.5-7B-Instruct-LambdaInfV2-264
rd211/Qwen2.5-7B-Instruct-HardLambda0.5-298
rd211/Qwen2.5-7B-Instruct-HardLambda0.1-400
rd211/Qwen2.5-7B-Instruct-HardLambda0.5-208
8B
•
Updated
•
1
rd211/Qwen2.5-7B-Instruct-HardLambda0.25-204
8B
•
Updated
•
1
rd211/Qwen2.5-7B-Instruct-HardLambda0.1-220
8B
•
Updated
•
1
rd211/Qwen2.5-7B-Instruct-Lambda1.0-300
rd211/Qwen2.5-7B-Instruct-HardLambda0.25-194
rd211/Qwen2.5-7B-Instruct-Lambda1.0-Bristen-190
rd211/Qwen2.5-7B-Instruct-MDPO
rd211/Qwen2.5-7B-Instruct-MinOld1.0-190
rd211/Qwen2.5-7B-Instruct-Min0.5-196
rd211/Qwen2.5-7B-Instruct-ParetoInf-484
rd211/Qwen2.5-7B-Instruct-NOLEAK-V2-212
rd211/Qwen2.5-7B-Instruct-TUTOR-THINK-Hard1.0-218
rd211/Qwen2.5-7B-Instruct-TUTOR-Hard1.0-206
rd211/Qwen2.5-7B-Instruct-TUTOR-Hard5.0-196
rd211/Qwen2.5-7B-Instruct-TUTOR-NoJudges-288
rd211/Qwen2.5-7B-Instruct-TUTOR-NoJudges
Updated
rd211/Qwen2.5-7B-Instruct-TUTOR-THINK-Hard0.5-210
rd211/Qwen2.5-7B-Instruct-TUTOR-Hard0.5-196
rd211/Qwen2.5-7B-Instruct-TUTOR-SFT
Text Generation
•
8B
•
Updated
•
1
rd211/Qwen2.5-7B-Instruct-Pareto1.0-300
rd211/Qwen2.5-7B-Instruct-Pareto0.25-388
8B
•
Updated
rd211/Qwen2.5-7B-Instruct-360r
8B
•
Updated
rd211/Qwen2.5-7B-Instruct-Pareto0.5-260
8B
•
Updated
rd211/Qwen2.5-7B-Instruct-TUTOR-THINK-348
8B
•
Updated
rd211/Qwen2.5-7B-Instruct-TUTOR-RL-PARETO_0.25-200
8B
•
Updated