jekunz commited on
Commit
7d37246
·
verified ·
1 Parent(s): b856c8d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -4
README.md CHANGED
@@ -5,8 +5,12 @@ license: llama3.2
5
  tags:
6
  - generated_from_trainer
7
  model-index:
8
- - name: llama32-1b-fineweb-is-lora-peft-effbatch512
9
  results: []
 
 
 
 
10
  ---
11
 
12
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -14,11 +18,11 @@ should probably proofread and complete it, then remove this comment. -->
14
 
15
  # llama32-1b-fineweb-is-lora-peft-effbatch512
16
 
17
- This model is a fine-tuned version of [meta-llama/Llama-3.2-1B-Instruct](https://huggingface.co/meta-llama/Llama-3.2-1B-Instruct) on an unknown dataset.
18
 
19
  ## Model description
20
 
21
- More information needed
22
 
23
  ## Intended uses & limitations
24
 
@@ -26,7 +30,7 @@ More information needed
26
 
27
  ## Training and evaluation data
28
 
29
- More information needed
30
 
31
  ## Training procedure
32
 
 
5
  tags:
6
  - generated_from_trainer
7
  model-index:
8
+ - name: llama32-1b-fineweb-is-lora
9
  results: []
10
+ datasets:
11
+ - HuggingFaceFW/fineweb-2
12
+ language:
13
+ - is
14
  ---
15
 
16
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
18
 
19
  # llama32-1b-fineweb-is-lora-peft-effbatch512
20
 
21
+ This model is a fine-tuned version of [meta-llama/Llama-3.2-1B-Instruct](https://huggingface.co/meta-llama/Llama-3.2-1B-Instruct) on the Icelandic portion of Fineweb-2.
22
 
23
  ## Model description
24
 
25
+ LoRA rank 256, alpha 512.
26
 
27
  ## Intended uses & limitations
28
 
 
30
 
31
  ## Training and evaluation data
32
 
33
+ Training data: Icelandic portion of Fineweb-2
34
 
35
  ## Training procedure
36