Commit
·
a3f859d
1
Parent(s):
7f6f0ad
Update README.md
Browse files
README.md
CHANGED
|
@@ -1,3 +1,10 @@
|
|
| 1 |
---
|
| 2 |
license: apache-2.0
|
| 3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
---
|
| 2 |
license: apache-2.0
|
| 3 |
---
|
| 4 |
+
|
| 5 |
+
This is flan-t5-base finetuned on [wookieepedia](https://starwars.fandom.com/wiki/Main_Page). First, the data was downloaded using: https://robvanderg.github.io/datasets/wikia/ , then I created a script for doing LM with T5, which is uploaded in this repository (`run_t5_mlm_torch.py` note that not all functionality is tested). Finally, it was trained with the following command:
|
| 6 |
+
```
|
| 7 |
+
python3 retrain_torch.py --train_file ../starwarsfandomcom-20200223.txt.cleaned.tok.uniq.txt --output_dir flan-t5-base-starwars --validation_split_percentage 1 --model_name_or_path google/flan-t5-base --max_seq_length 512 --do_train --do_eval
|
| 8 |
+
```
|
| 9 |
+
Final perplexity was 3.38
|
| 10 |
+
|