Respair
/

Higgs_Codec_Extended

audio_tokenizer

Model card Files Files and versions

Respair commited on Aug 13

Commit

86160c1

·

verified ·

1 Parent(s): 86ba417

Update README.md

Files changed (1) hide show

README.md +10 -4

README.md CHANGED Viewed

@@ -8,11 +8,17 @@ tags:
 This is an on-going project. it is a modified version of Higgs-Boson audio tokenizer, you can fully train it. all scripts have been tested.
 a Few notes however:
-1- this is not backward compatible with the original checkpoint (I think you can tweak it to be, but you have to adhere to Boson community license if you do.)
-2- I highly recommend you to pretrain the model without the mel and adversarial setup first. it saves you a significant amount of compute and speed-up your convergence.
-3- for the semantic teacher, I am using ```utter-project/mHuBERT-147``` which has a good multilingual support. if you want the original setup you can change it in the config.
 I will train a checkpoint on a larger enough dataset one of these days after figuring out a few things first. but the setup is solid.
 Happy training (~~inshallah~~).

 This is an on-going project. it is a modified version of Higgs-Boson audio tokenizer, you can fully train it. all scripts have been tested.
 a Few notes however:
+  - this is not backward compatible with the original checkpoint (I think you can tweak it to be, but you have to adhere to Boson community license if you do.)
+  - I highly recommend you to pretrain the model without the mel and adversarial setup first. it saves you a significant amount of compute, time and speed-up your convergence. raise the batch size as much as you can before the adversarial phase.
+  - for the semantic teacher, I am using ```utter-project/mHuBERT-147``` which has a good multilingual support. if you want the original setup you can change it in the config.
 I will train a checkpoint on a larger enough dataset one of these days after figuring out a few things first. but the setup is solid.
+```bash
+python train_boson_mixed_precision.py --data_csv "yourdata.csv" \
+                                      --config config.json --batch_size 42  \
+                                      --use_mixed_precision \
+                                      --use_discriminator
+```
 Happy training (~~inshallah~~).