SmolLM2-360m-German-Instruct

Showcase image for SmolLM2-360m-German-Instruct

This is a continued pre-train as well as an instruct fine-tune done using Unsloth in order to make SmolLM2 360m capable of speaking German. It has been trained on 15% of the German Wikipedia as well as the full German version of the Alpaca-GPT4 dataset (translated version).

Even though a lot of training has been done, this is still a tiny model and is highly limited to its small size. Expect many hallucinations and do not use this in a demanding production workflow.

Links

Cite as

@misc{smollm2germaninstruct,
  author       = {Magnus Leonard Schlinsog},
  title        = {Enhancing Foreign Language Proficiency in SmolLM2-360M via Continued Pretraining and Instruction Fine-Tuning},
  year         = {2025},
  url          = {https://huggingface.co/mags0ft/SmolLM2-360m-German-Instruct},
}
Downloads last month
263
Safetensors
Model size
362M params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for mags0ft/SmolLM2-360m-German-Instruct

Quantized
(33)
this model

Datasets used to train mags0ft/SmolLM2-360m-German-Instruct