benjaminsinzore commited on
Commit
7fef1ad
·
verified ·
1 Parent(s): 0718ed1

Unsloth Model Card

Browse files
Files changed (1) hide show
  1. README.md +8 -8
README.md CHANGED
@@ -1,21 +1,21 @@
1
  ---
2
- base_model: unsloth/llama-3.2-3b-instruct-unsloth-bnb-4bit
3
  tags:
4
  - text-generation-inference
5
  - transformers
6
  - unsloth
7
  - llama
8
- - trl
9
- - grpo
10
  license: apache-2.0
11
  language:
12
  - en
13
- datasets:
14
- - openai/gsm8k
15
  ---
16
 
17
- # Uploaded model
18
 
19
- - **Developed by:** Benjamin Sinzore
20
  - **License:** apache-2.0
21
- - **Finetuned from model :** unsloth/llama-3.2-3B
 
 
 
 
 
1
  ---
2
+ base_model: benjaminsinzore/Basqui-R1-4B-v1
3
  tags:
4
  - text-generation-inference
5
  - transformers
6
  - unsloth
7
  - llama
 
 
8
  license: apache-2.0
9
  language:
10
  - en
 
 
11
  ---
12
 
13
+ # Uploaded finetuned model
14
 
15
+ - **Developed by:** benjaminsinzore
16
  - **License:** apache-2.0
17
+ - **Finetuned from model :** benjaminsinzore/Basqui-R1-4B-v1
18
+
19
+ This llama model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
20
+
21
+ [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)