benjaminsinzore
/

Basqui-R1-4B-v1

Text Generation

text-generation-inference

Model card Files Files and versions Community

benjaminsinzore commited on 17 days ago

Commit

7fef1ad

·

verified ·

1 Parent(s): 0718ed1

Unsloth Model Card

Files changed (1) hide show

README.md +8 -8

README.md CHANGED Viewed

@@ -1,21 +1,21 @@
 ---
-base_model: unsloth/llama-3.2-3b-instruct-unsloth-bnb-4bit
 tags:
 - text-generation-inference
 - transformers
 - unsloth
 - llama
-- trl
-- grpo
 license: apache-2.0
 language:
 - en
-datasets:
-- openai/gsm8k
 ---
-# Uploaded  model
-- **Developed by:** Benjamin Sinzore
 - **License:** apache-2.0
-- **Finetuned from model :** unsloth/llama-3.2-3B

 ---
+base_model: benjaminsinzore/Basqui-R1-4B-v1
 tags:
 - text-generation-inference
 - transformers
 - unsloth
 - llama
 license: apache-2.0
 language:
 - en
 ---
+# Uploaded finetuned  model
+- **Developed by:** benjaminsinzore
 - **License:** apache-2.0
+- **Finetuned from model :** benjaminsinzore/Basqui-R1-4B-v1
+This llama model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
+[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)