arianaazarbal
/

hacker-lenpenalty-7b-correct_tests-low_reward-low_reward-3-tests-20250626_023501

Reinforcement Learning

Model card Files Files and versions Community

hacker-lenpenalty-7b-correct_tests-low_reward-low_reward-3-tests-20250626_023501

Ctrl+K

Ctrl+K

1 contributor

History: 2 commits

arianaazarbal's picture

Push model using huggingface_hub.

8570d87 verified about 10 hours ago

.gitattributes

1.57 kB

Push model using huggingface_hub. about 10 hours ago
README.md

1.5 kB

Push model using huggingface_hub. about 10 hours ago
adapter_config.json

823 Bytes

Push model using huggingface_hub. about 10 hours ago
adapter_model.safetensors

40.4 MB
LFS

Push model using huggingface_hub. about 10 hours ago
chat_template.jinja

288 Bytes

Push model using huggingface_hub. about 10 hours ago
config.json

1.3 kB

Push model using huggingface_hub. about 10 hours ago
pytorch_model.bin
Detected Pickle imports (3)
- "torch.FloatStorage",
- "collections.OrderedDict",
- "torch._utils._rebuild_tensor_v2"
What is a pickle import?
16.3 kB
LFS

Push model using huggingface_hub. about 10 hours ago
special_tokens_map.json

371 Bytes

Push model using huggingface_hub. about 10 hours ago
tokenizer.json

11.4 MB
LFS

Push model using huggingface_hub. about 10 hours ago
tokenizer_config.json

4.49 kB

Push model using huggingface_hub. about 10 hours ago