llama 3 self-align experiments
Collection
Replicating the pipeline for StarCoder-2 Instruct on Llama-3-8B with some tweaks https://huggingface.co/blog/sc2-instruct
•
4 items
•
Updated
•
6
[WEIGHTS TO BE UPLOADED ONCE DONE]
The config.yaml
should be used during accelerate launch
, and run.sh
was used to launch the training using the StarCoder2 Self-Align training script.
Some tweaks were performed to get this working on 48GB vRAM:
per_device_batch_size
is 2