Safetensors
English
Finnish
bloom
laineyyy commited on
Commit
c304d85
·
verified ·
1 Parent(s): 76bee80

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +23 -7
README.md CHANGED
@@ -10,21 +10,37 @@ datasets:
10
  - LumiOpen/instruction-collection-fin
11
  ---
12
 
13
- This is an SFT-tuned model of [Poro-34B](https://huggingface.co/LumiOpen/Poro-34B) with English and Finnish data.
14
 
15
- We use a curated subset of Open Assistant 2 and translated the dataset into Finnish using Poro-34B. We trained this model for experiments on the impact of multilingual instruction-tuning. For a better chat experience, we recommend using [Poro-34B-chat](https://huggingface.co/LumiOpen/Poro-34B-chat) instead.
16
 
17
- **Datasets**
18
 
19
- **Recipes**
20
 
 
 
21
 
 
 
22
 
23
- **Evaluation**
24
- TBA
25
 
26
 
27
- **Citation**
 
 
 
 
 
 
 
 
 
 
 
 
 
28
  ```
29
  @inproceedings{
30
  zosa2024got,
 
10
  - LumiOpen/instruction-collection-fin
11
  ---
12
 
13
+ This is an SFT-tuned model of [Poro-34B](https://huggingface.co/LumiOpen/Poro-34B) with English and Finnish data. We trained this model for experiments on the impact of multilingual instruction-tuning. For a better chat experience, we recommend using [Poro-34B-chat](https://huggingface.co/LumiOpen/Poro-34B-chat) instead.
14
 
15
+ ## Datasets
16
 
17
+ **SFT**
18
 
19
+ We use a curated subset of Open Assistant 2 and translated the dataset into Finnish using Poro-34B.
20
 
21
+ **English OASST2**
22
+ - [oasst2_curated](https://huggingface.co/datasets/sablo/oasst2_curated)
23
 
24
+ **Finnish OASST2**
25
+ - [instruction-collection-fin](https://huggingface.co/datasets/LumiOpen/instruction-collection-fin) (oasst2 subset)
26
 
27
+ **DPO**
 
28
 
29
 
30
+ ## Recipes
31
+
32
+ **SFT**
33
+
34
+ **DPO**
35
+
36
+ ## Evaluation
37
+
38
+
39
+
40
+ ## Citation
41
+
42
+ We discuss our experimental setup and results in our NoDaLiDa 2025 paper.
43
+
44
  ```
45
  @inproceedings{
46
  zosa2024got,