amd
/

Zebra-Llama-1B-8MLA-8Mamba-SFT

alignment-handbook

Generated from Trainer

Model card Files Files and versions

Zebra-Llama-1B-8MLA-8Mamba-SFT

5.08 GB

2 contributors

History: 4 commits

Mingyuyang-1's picture

Update README.md

943839d verified 5 months ago

.gitattributes

1.52 kB

initial commit 5 months ago
README.md

11.1 kB

Update README.md 5 months ago
config.json

927 Bytes

Update config.json 5 months ago
generation_config.json

184 Bytes

Upload folder using huggingface_hub 5 months ago
hybrid_config.json

1.18 kB

Upload folder using huggingface_hub 5 months ago
lm_harness_eval.md

212 kB

Upload folder using huggingface_hub 5 months ago
pytorch_model.bin
Detected Pickle imports (3)
- "torch._utils._rebuild_tensor_v2",
- "torch.FloatStorage",
- "collections.OrderedDict"
What is a pickle import?
5.07 GB
xet

Upload folder using huggingface_hub 5 months ago
special_tokens_map.json

325 Bytes

Upload folder using huggingface_hub 5 months ago
tokenizer.json

9.09 MB

Upload folder using huggingface_hub 5 months ago
tokenizer_config.json

50.9 kB

Upload folder using huggingface_hub 5 months ago