Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
DiscreteSpeech
/
DSTK
like
7
Follow
Discrete Speech Project
6
English
Chinese
speech
tokenization
detokenization
text2token
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
30d5623
DSTK
/
thirdparty
/
G2P
/
text
34.5 MB
1 contributor
History:
1 commit
gooorillax
first push of codes and models for g2p, t2u, tokenizer and detokenizer
cd8454d
2 months ago
g2pw
first push of codes and models for g2p, t2u, tokenizer and detokenizer
2 months ago
ja_userdic
first push of codes and models for g2p, t2u, tokenizer and detokenizer
2 months ago
zh_normalization
first push of codes and models for g2p, t2u, tokenizer and detokenizer
2 months ago
.gitignore
Safe
27 Bytes
first push of codes and models for g2p, t2u, tokenizer and detokenizer
2 months ago
__init__.py
Safe
886 Bytes
first push of codes and models for g2p, t2u, tokenizer and detokenizer
2 months ago
cantonese.py
Safe
5.28 kB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
2 months ago
chinese.py
Safe
6.46 kB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
2 months ago
chinese2.py
Safe
10.5 kB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
2 months ago
cleaner.py
Safe
3.29 kB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
2 months ago
cmudict-fast.rep
Safe
3.61 MB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
2 months ago
cmudict.rep
Safe
3.73 MB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
2 months ago
engdict-hot.rep
Safe
75 Bytes
first push of codes and models for g2p, t2u, tokenizer and detokenizer
2 months ago
engdict_cache.pickle
Suspicious
pickle
Pickle imports
No problematic imports detected
What is a pickle import?
5.97 MB
xet
first push of codes and models for g2p, t2u, tokenizer and detokenizer
2 months ago
english.py
Safe
10.8 kB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
2 months ago
japanese.py
Safe
7.62 kB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
2 months ago
korean.py
Safe
7.99 kB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
2 months ago
namedict_cache.pickle
Safe
pickle
Pickle imports
No problematic imports detected
What is a pickle import?
761 kB
xet
first push of codes and models for g2p, t2u, tokenizer and detokenizer
2 months ago
opencpop-strict.txt
Safe
4.08 kB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
2 months ago
symbols.py
Safe
4.56 kB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
2 months ago
symbols2.py
Safe
8.31 kB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
2 months ago
tone_sandhi.py
Safe
24.5 kB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
2 months ago