Wenlin Zhang
Upload zero_to_fp32.py
66984eb
verified
-
global_step10341
Upload directory: global_step10341
-
1.57 kB
Upload tokenizer.json
-
5.09 kB
Upload README.md
-
717 Bytes
Upload adapter_config.json
-
162 MB
Upload adapter_model.safetensors
-
605 Bytes
Upload added_tokens.json
-
16 Bytes
Upload latest
-
1.67 MB
Upload merges.txt
rng_state.pth
Detected Pickle imports (7)
- "_codecs.encode",
- "torch.ByteStorage",
- "numpy.dtype",
- "collections.OrderedDict",
- "numpy.core.multiarray._reconstruct",
- "numpy.ndarray",
- "torch._utils._rebuild_tensor_v2"
How to fix it?
14.2 kB
Upload rng_state.pth
-
1.06 kB
Upload scheduler.pt
-
616 Bytes
Upload special_tokens_map.json
-
11.4 MB
Upload tokenizer.json
-
7.29 kB
Upload tokenizer_config.json
-
563 kB
Upload trainer_state.json
training_args.bin
Detected Pickle imports (14)
- "transformers.trainer_pt_utils.AcceleratorConfig",
- "transformers.integrations.deepspeed.HfTrainerDeepSpeedConfig",
- "transformers.training_args.OptimizerNames",
- "transformers.trainer_utils.SaveStrategy",
- "llamafactory.hparams.training_args.TrainingArguments",
- "transformers.trainer_utils.HubStrategy",
- "accelerate.state.PartialState",
- "torch.bfloat16",
- "accelerate.utils.dataclasses.DeepSpeedPlugin",
- "accelerate.utils.dataclasses.DistributedType",
- "transformers.integrations.deepspeed.HfDeepSpeedConfig",
- "torch.device",
- "transformers.trainer_utils.SchedulerType",
- "transformers.trainer_utils.IntervalStrategy"
How to fix it?
7.54 kB
Upload training_args.bin
-
2.78 MB
Upload vocab.json
-
33.3 kB
Upload zero_to_fp32.py