arianaazarbal
/

hacker-lenpenalty-7b-correct_tests-low_reward-low_reward-3-tests-20250625_223102

Reinforcement Learning

Model card Files Files and versions Community

hacker-lenpenalty-7b-correct_tests-low_reward-low_reward-3-tests-20250625_223102

Commit History

Push model using huggingface_hub.

2a9936a
verified

arianaazarbal commited on about 19 hours ago

initial commit

2710f51
verified

arianaazarbal commited on about 19 hours ago