arianaazarbal
/

hacker-lenpenalty-7b-correct_tests-low_reward-low_reward-3-tests-20250626_023105

Reinforcement Learning

Model card Files Files and versions Community

hacker-lenpenalty-7b-correct_tests-low_reward-low_reward-3-tests-20250626_023105

Commit History

Push model using huggingface_hub.

e8f25c8
verified

arianaazarbal commited on about 15 hours ago

initial commit

65c6aaa
verified

arianaazarbal commited on about 15 hours ago