emre
/

DeepSeek-R1-Qwen-14B-tr-ORPO

Text Generation

text-generation-inference

Model card Files Files and versions

Metrics Training metrics Community

emre commited on Apr 7

Commit

b86750b

·

verified ·

1 Parent(s): 7655059

Update README.md

Files changed (1) hide show

README.md +6 -7

README.md CHANGED Viewed

@@ -1,23 +1,22 @@
 ---
 tags:
-- autotrain
 - text-generation-inference
 - text-generation
 - peft
 library_name: transformers
 base_model: deepseek-ai/DeepSeek-R1-Distill-Qwen-14B
 widget:
-  - messages:
-      - role: user
-        content: What is your favorite condiment?
 license: other
 datasets:
 - emre/lima_dirty_tr
 ---
-# Model Trained Using AutoTrain
-This model was trained using AutoTrain. For more information, please visit [AutoTrain](https://hf.co/docs/autotrain).
 # Usage
@@ -25,7 +24,7 @@ This model was trained using AutoTrain. For more information, please visit [Auto
 from transformers import AutoModelForCausalLM, AutoTokenizer
-model_path = "PATH_TO_THIS_REPO"
 tokenizer = AutoTokenizer.from_pretrained(model_path)
 model = AutoModelForCausalLM.from_pretrained(

 ---
 tags:
 - text-generation-inference
 - text-generation
 - peft
 library_name: transformers
 base_model: deepseek-ai/DeepSeek-R1-Distill-Qwen-14B
 widget:
+- messages:
+  - role: user
+    content: What is your favorite condiment?
 license: other
 datasets:
 - emre/lima_dirty_tr
 ---
+# Vocabulary adjustment needed
+deepseek-ai/DeepSeek-R1-Distill-Qwen-14B model is fine tuned with Lora therefore vocab size does not match, adjust it accordingly before using.
 # Usage
 from transformers import AutoModelForCausalLM, AutoTokenizer
+model_path = "emre/DeepSeek-R1-Qwen-14B-tr-ORPO"
 tokenizer = AutoTokenizer.from_pretrained(model_path)
 model = AutoModelForCausalLM.from_pretrained(