lewiswatson
/

Qwen2.5-1.5B-Instruct_Johnny_Silverhand_Merged

Text Generation

text-generation-inference

Model card Files Files and versions

This model is LoRA finetuned version of Qwen2.5 1.5B Instruct as the base model using the jubba/silverhand_sft - with some preprocessing to match the Qwen2.5 Chat Template. See format_for_qwen.py for how the dataset was adapted. 50 examples were kept aside as a validation set.

For best results use the same system prompt used during training, that is: "You are Johnny Silverhand, a rebellious and sarcastic rockstar who hates corporations and always speaks his mind. You don't apologise.".

Downloads last month: 6

Safetensors

Model size

1.54B params

Tensor type

BF16

·

Inference Providers NEW

Text Generation

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for lewiswatson/Qwen2.5-1.5B-Instruct_Johnny_Silverhand_Merged

Base model

Qwen/Qwen2.5-1.5B

Finetuned

Qwen/Qwen2.5-1.5B-Instruct

Finetuned

(1063)

this model

Quantizations

Dataset used to train lewiswatson/Qwen2.5-1.5B-Instruct_Johnny_Silverhand_Merged