Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
daven3
/
molmoe-step-16500-w-router
like
0
Safetensors
olmoe
License:
mit
Model card
Files
Files and versions
Community
main
molmoe-step-16500-w-router
Ctrl+K
Ctrl+K
1 contributor
History:
2 commits
daven3
ckpt for step 16500 with router
4c81226
3 months ago
.gitattributes
Safe
1.52 kB
initial commit
3 months ago
README.md
Safe
24 Bytes
initial commit
3 months ago
config.json
Safe
889 Bytes
ckpt for step 16500 with router
3 months ago
generation_config.json
Safe
120 Bytes
ckpt for step 16500 with router
3 months ago
latest
16 Bytes
ckpt for step 16500 with router
3 months ago
model-00001-of-00006.safetensors
5 GB
LFS
ckpt for step 16500 with router
3 months ago
model-00002-of-00006.safetensors
5 GB
LFS
ckpt for step 16500 with router
3 months ago
model-00003-of-00006.safetensors
5 GB
LFS
ckpt for step 16500 with router
3 months ago
model-00004-of-00006.safetensors
5 GB
LFS
ckpt for step 16500 with router
3 months ago
model-00005-of-00006.safetensors
5 GB
LFS
ckpt for step 16500 with router
3 months ago
model-00006-of-00006.safetensors
1.74 GB
LFS
ckpt for step 16500 with router
3 months ago
model.safetensors.index.json
Safe
564 kB
ckpt for step 16500 with router
3 months ago
scheduler.pt
pickle
Pickle imports
No problematic imports detected
What is a pickle import?
1.06 kB
LFS
ckpt for step 16500 with router
3 months ago
special_tokens_map.json
Safe
293 Bytes
ckpt for step 16500 with router
3 months ago
tokenizer.json
Safe
3.57 MB
ckpt for step 16500 with router
3 months ago
tokenizer_config.json
Safe
5.9 kB
ckpt for step 16500 with router
3 months ago
trainer_state.json
2.91 MB
ckpt for step 16500 with router
3 months ago
training_args.bin
pickle
Detected Pickle imports (14)
"transformers.integrations.deepspeed.HfDeepSpeedConfig"
,
"llamafactory.hparams.training_args.TrainingArguments"
,
"transformers.trainer_pt_utils.AcceleratorConfig"
,
"transformers.training_args.OptimizerNames"
,
"transformers.trainer_utils.SchedulerType"
,
"accelerate.utils.dataclasses.DeepSpeedPlugin"
,
"transformers.trainer_utils.IntervalStrategy"
,
"torch.bfloat16"
,
"accelerate.utils.dataclasses.DistributedType"
,
"torch.device"
,
"transformers.integrations.deepspeed.HfTrainerDeepSpeedConfig"
,
"accelerate.state.PartialState"
,
"transformers.trainer_utils.HubStrategy"
,
"transformers.trainer_utils.SaveStrategy"
How to fix it?
7.48 kB
LFS
ckpt for step 16500 with router
3 months ago
zero_to_fp32.py
Safe
33.3 kB
ckpt for step 16500 with router
3 months ago