Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
jumelet
/
gptbert-jpn-250steps-base
like
0
Fill-Mask
Transformers
PyTorch
Safetensors
gpt_bert
feature-extraction
gpt-bert
babylm
remote-code
custom_code
License:
other
Model card
Files
Files and versions
xet
Community
Train
Deploy
Use this model
main
gptbert-jpn-250steps-base
2.62 GB
1 contributor
History:
5 commits
jumelet
Add main & ema weights for jpn
1dde150
verified
9 days ago
.gitattributes
Safe
1.52 kB
initial commit
10 days ago
README.md
2.33 kB
Add main & ema weights for jpn
9 days ago
config.json
Safe
1.01 kB
Add main & ema weights for jpn
9 days ago
configuration_gpt_bert.py
Safe
1.28 kB
Add main & ema weights for jpn
10 days ago
jpn-2gpu-250steps.bin
pickle
Detected Pickle imports (4)
"torch.LongStorage"
,
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
,
"torch.FloatStorage"
What is a pickle import?
503 MB
xet
Add main & ema weights for jpn
9 days ago
jpn-2gpu-250steps_ema.bin
pickle
Detected Pickle imports (4)
"torch.LongStorage"
,
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
,
"torch.FloatStorage"
What is a pickle import?
503 MB
xet
Add main & ema weights for jpn
9 days ago
model.safetensors
553 MB
xet
Add main & ema weights for jpn
9 days ago
model_ema.safetensors
553 MB
xet
Add main & ema weights for jpn
9 days ago
modeling_gpt_bert.py
Safe
28.3 kB
Add main & ema weights for jpn
10 days ago
original_project_config.json
Safe
407 Bytes
Add main & ema weights for jpn
9 days ago
pytorch_model.bin
pickle
Detected Pickle imports (4)
"torch.LongStorage"
,
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
,
"torch.FloatStorage"
What is a pickle import?
503 MB
xet
Add main & ema weights for jpn
9 days ago
special_tokens_map.json
Safe
122 Bytes
Add main & ema weights for jpn
10 days ago
tokenizer.json
1.38 MB
Add main & ema weights for jpn
9 days ago
tokenizer_config.json
Safe
3.07 kB
Add main & ema weights for jpn
10 days ago