Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
shuttleai
/
shuttle-3.5-moe-ckpts
like
0
Follow
ShuttleAI
235
PEFT
Safetensors
qwen3_moe
axolotl
Generated from Trainer
4-bit precision
bitsandbytes
License:
apache-2.0
Model card
Files
Files and versions
Community
Use this model
3dfa3ca
shuttle-3.5-moe-ckpts
/
checkpoint-1392
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
xtristan
Training in progress, step 1392, checkpoint
3dfa3ca
verified
11 days ago
global_step1392
Training in progress, step 1392, checkpoint
11 days ago
README.md
Safe
5.09 kB
Training in progress, step 1392, checkpoint
11 days ago
adapter_config.json
Safe
850 Bytes
Training in progress, step 1392, checkpoint
11 days ago
adapter_model.safetensors
6.76 GB
LFS
Training in progress, step 1392, checkpoint
11 days ago
added_tokens.json
Safe
707 Bytes
Training in progress, step 1392, checkpoint
11 days ago
latest
Safe
15 Bytes
Training in progress, step 1392, checkpoint
11 days ago
merges.txt
Safe
1.67 MB
Training in progress, step 1392, checkpoint
11 days ago
rng_state.pth
pickle
Detected Pickle imports (7)
"_codecs.encode"
,
"collections.OrderedDict"
,
"numpy._core.multiarray._reconstruct"
,
"torch._utils._rebuild_tensor_v2"
,
"torch.ByteStorage"
,
"numpy.dtype"
,
"numpy.ndarray"
How to fix it?
14.2 kB
LFS
Training in progress, step 1392, checkpoint
11 days ago
scheduler.pt
pickle
Pickle imports
No problematic imports detected
What is a pickle import?
1.06 kB
LFS
Training in progress, step 1392, checkpoint
11 days ago
special_tokens_map.json
Safe
613 Bytes
Training in progress, step 1392, checkpoint
11 days ago
tokenizer.json
Safe
11.4 MB
LFS
Training in progress, step 1392, checkpoint
11 days ago
tokenizer_config.json
Safe
5.72 kB
Training in progress, step 1392, checkpoint
11 days ago
trainer_state.json
238 kB
Training in progress, step 1392, checkpoint
11 days ago
training_args.bin
pickle
Detected Pickle imports (14)
"transformers.trainer_utils.IntervalStrategy"
,
"transformers.training_args.OptimizerNames"
,
"accelerate.utils.dataclasses.DeepSpeedPlugin"
,
"torch.bfloat16"
,
"transformers.trainer_utils.SchedulerType"
,
"accelerate.state.PartialState"
,
"torch.device"
,
"transformers.trainer_utils.HubStrategy"
,
"accelerate.utils.dataclasses.DistributedType"
,
"axolotl.core.training_args.AxolotlTrainingArguments"
,
"transformers.trainer_utils.SaveStrategy"
,
"transformers.integrations.deepspeed.HfDeepSpeedConfig"
,
"transformers.integrations.deepspeed.HfTrainerDeepSpeedConfig"
,
"transformers.trainer_pt_utils.AcceleratorConfig"
How to fix it?
8.82 kB
LFS
Training in progress, step 1392, checkpoint
11 days ago
vocab.json
Safe
2.78 MB
Training in progress, step 1392, checkpoint
11 days ago
zero_to_fp32.py
Safe
29.2 kB
Training in progress, step 1392, checkpoint
11 days ago