lianghsun
/

Llama-3.2-Taiwan-3B-Instruct

Text Generation

text-generation-inference

Model card Files Files and versions

Llama-3.2-Taiwan-3B-Instruct / model-00001-of-00002.safetensors

Commit History

Fine-tuning is based on the foundation model version v2024.12.28, and it uses self-prepared instruction datasets for this round of fine-tuning.

44cd75b

lianghsun commited on Jan 1

Complete 1st round DPO training (10/10 epochs).

89cf561

lianghsun commited on Dec 12, 2024

Completed SFT training (5/5 epochs). Preparing for multi-round DPO training.

ad1233d

lianghsun commited on Nov 27, 2024

Updated model version to v2024.11.25, training progressed to (3/10) epochs. Still in SFT stage, DPO training remains pending.

7967d13

lianghsun commited on Nov 25, 2024

Initial upload: Model version v2024.11.22, training completed up to (1/10) epochs.

ac505d8

lianghsun commited on Nov 22, 2024