ajagota71
/

toxicity-reward-model-160m-prompt-output-max-margin-1-seed-100-unfrozen-layers-0-pythia-160m

Model card Files Files and versions Community

toxicity-reward-model-160m-prompt-output-max-margin-1-seed-100-unfrozen-layers-0-pythia-160m

Ctrl+K

Ctrl+K

1 contributor

History: 2 commits

ajagota71's picture

Upload IRL reward model

6a248c3 verified about 1 month ago

.gitattributes

1.52 kB

initial commit about 1 month ago
README.md

450 Bytes

Upload IRL reward model about 1 month ago
config.json

757 Bytes

Upload IRL reward model about 1 month ago
config.yaml

1.41 kB

Upload IRL reward model about 1 month ago
generation_config.json

111 Bytes

Upload IRL reward model about 1 month ago
model.safetensors

649 MB
LFS

Upload IRL reward model about 1 month ago
reward_model_config.json

106 Bytes

Upload IRL reward model about 1 month ago
special_tokens_map.json

473 Bytes

Upload IRL reward model about 1 month ago
tokenizer.json

3.56 MB

Upload IRL reward model about 1 month ago
tokenizer_config.json

4.88 kB

Upload IRL reward model about 1 month ago
v_head.pt
Detected Pickle imports (3)
- "torch._utils._rebuild_tensor_v2",
- "collections.OrderedDict",
- "torch.FloatStorage"
What is a pickle import?
4.25 kB
LFS

Upload IRL reward model about 1 month ago