This model has been pushed to the Hub using the PytorchModelHubMixin integration:

  • Code: [More Information Needed]
  • Paper: [More Information Needed]
  • Docs: [More Information Needed]
Downloads last month
49
Safetensors
Model size
2.15B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including science-of-finetuning/Meta-Llama-3.1-8B-L16-mu2.1e-02-lr1e-04-local-shuffling-CrosscoderLoss