arianaazarbal
/

hacker-lenpenalty-7b-correct_tests-low_reward-low_reward-3-tests-20250625_223427

Reinforcement Learning

Model card Files Files and versions Community

hacker-lenpenalty-7b-correct_tests-low_reward-low_reward-3-tests-20250625_223427

Commit History

Push model using huggingface_hub.

d31a1ee
verified

arianaazarbal commited on about 18 hours ago

initial commit

d9ce74e
verified

arianaazarbal commited on about 18 hours ago