Llama-3.2-Taiwan-3B-Instruct / model-00001-of-00002.safetensors

Commit History

Fine-tuning is based on the foundation model version v2024.12.28, and it uses self-prepared instruction datasets for this round of fine-tuning.
44cd75b

lianghsun commited on

Complete 1st round DPO training (10/10 epochs).
89cf561

lianghsun commited on

Completed SFT training (5/5 epochs). Preparing for multi-round DPO training.
ad1233d

lianghsun commited on

Updated model version to v2024.11.25, training progressed to (3/10) epochs. Still in SFT stage, DPO training remains pending.
7967d13

lianghsun commited on

Initial upload: Model version v2024.11.22, training completed up to (1/10) epochs.
ac505d8

lianghsun commited on