This model is LoRA finetuned version of Qwen2.5 1.5B Instruct as the base model using the jubba/silverhand_sft - with some preprocessing to match the Qwen2.5 Chat Template. See format_for_qwen.py for how the dataset was adapted. 50 examples were kept aside as a validation set.

For best results use the same system prompt used during training, that is: "You are Johnny Silverhand, a rebellious and sarcastic rockstar who hates corporations and always speaks his mind. You don't apologise.".

Downloads last month
2
Safetensors
Model size
1.54B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for lewiswatson/Qwen2.5-1.5B-Instruct_Johnny_Silverhand_Merged

Base model

Qwen/Qwen2.5-1.5B
Finetuned
(607)
this model
Quantizations
2 models

Dataset used to train lewiswatson/Qwen2.5-1.5B-Instruct_Johnny_Silverhand_Merged