Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
yhavinga
/
dutch-tokenizer-arena
like
8
Running
App
Files
Files
Community
1
Fetching metadata from the HF Docker repository...
main
dutch-tokenizer-arena
/
vocab
/
glm_chinese
Ctrl+K
Ctrl+K
3 contributors
History:
2 commits
xu-song
add compress rate
814ee6b
over 1 year ago
chinese_sentencepiece
update
almost 2 years ago
README.md
Safe
487 Bytes
update
almost 2 years ago
__init__.py
Safe
1.34 kB
add compress rate
over 1 year ago
convert_vocab_to_txt.py
Safe
689 Bytes
update
almost 2 years ago
file_utils.py
Safe
8.38 kB
update
almost 2 years ago
glm_chinese.vocab.txt
Safe
659 kB
update
almost 2 years ago
sp_tokenizer.py
Safe
4.67 kB
update
almost 2 years ago
test.py
Safe
115 Bytes
add compress rate
over 1 year ago
test_glm.py
Safe
2.5 kB
update
almost 2 years ago
tokenization.py
Safe
51.9 kB
update
almost 2 years ago
tokenization_gpt2.py
Safe
13.5 kB
update
almost 2 years ago
utils.py
Safe
213 Bytes
update
almost 2 years ago
wordpiece.py
Safe
15.5 kB
update
almost 2 years ago