SmolLM CPT
Collection
Continued Pre-Training of SmolLM models on the Fineweb-2 portions of Scandinavian languages.
•
5 items
•
Updated
Higher loss than jekunz/smollm-135m-cpt-fineweb-faroese, may or may not be a bit better --> More unstable in the beginning, slightly lower loss in the end.
Training:
(renamed from smollm-135m-full-fineweb-fao-test2)
Base model
HuggingFaceTB/SmolLM2-135M