Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
DiscreteSpeech
/
DSTK
like
7
Follow
Discrete Speech Project
6
English
Chinese
speech
tokenization
detokenization
text2token
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
12da045
DSTK
/
evaluation
30.7 kB
1 contributor
History:
1 commit
gooorillax
first push of codes and models for g2p, t2u, tokenizer and detokenizer
cd8454d
about 2 months ago
README.md
Safe
303 Bytes
first push of codes and models for g2p, t2u, tokenizer and detokenizer
about 2 months ago
eval_detok_en.py
Safe
8.5 kB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
about 2 months ago
eval_detok_zh.py
Safe
4.12 kB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
about 2 months ago
eval_sim.py
Safe
5.91 kB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
about 2 months ago
patch_unispeech.py
Safe
6.74 kB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
about 2 months ago
patch_utils.py
Safe
4.88 kB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
about 2 months ago
requirements_sim.txt
Safe
94 Bytes
first push of codes and models for g2p, t2u, tokenizer and detokenizer
about 2 months ago
requirements_wer.txt
Safe
152 Bytes
first push of codes and models for g2p, t2u, tokenizer and detokenizer
about 2 months ago