Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
lapp0
/
distily_experiments_loss_kl
like
0
TensorBoard
Safetensors
distily
qwen2
Generated from Trainer
8-bit precision
bitsandbytes
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
main
distily_experiments_loss_kl
Ctrl+K
Ctrl+K
1 contributor
History:
25 commits
This model has 1 file scanned as unsafe.
Show
files
lapp0
End of training
6d41d04
verified
11 months ago
runs
End of training
11 months ago
.gitattributes
1.52 kB
initial commit
11 months ago
README.md
3.22 kB
End of training
11 months ago
added_tokens.json
80 Bytes
Training in progress, step 500
11 months ago
config.json
1.19 kB
Training in progress, step 500
11 months ago
generation_config.json
121 Bytes
End of training
11 months ago
merges.txt
1.67 MB
Training in progress, step 500
11 months ago
model.safetensors
988 MB
LFS
Training in progress, step 6187
11 months ago
special_tokens_map.json
367 Bytes
Training in progress, step 500
11 months ago
tokenizer.json
7.03 MB
Training in progress, step 500
11 months ago
tokenizer_config.json
1.3 kB
Training in progress, step 500
11 months ago
training_args.bin
Unsafe
pickle
Detected Pickle imports (17)
"_codecs.encode"
,
"transformers.trainer_utils.HubStrategy"
,
"accelerate.utils.dataclasses.DistributedType"
,
"distily.args.DistillationTrainingArguments"
,
"torch.device"
,
"accelerate.state.PartialState"
,
"torch._utils._rebuild_tensor_v2"
,
"torch.LongStorage"
,
"transformers.trainer_pt_utils.AcceleratorConfig"
,
"distily.metrics.PerplexityEvalCallback"
,
"tokenizers.Encoding"
,
"__builtin__.getattr"
,
"transformers.tokenization_utils_base.BatchEncoding"
,
"collections.OrderedDict"
,
"transformers.trainer_utils.IntervalStrategy"
,
"transformers.training_args.OptimizerNames"
,
"transformers.trainer_utils.SchedulerType"
How to fix it?
630 MB
LFS
Training in progress, step 500
11 months ago
vocab.json
2.78 MB
Training in progress, step 500
11 months ago