Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
QwQZh
/
gated_attention
like
4
Model card
Files
Files and versions
Community
main
gated_attention
/
1B_gate_elementwise
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
QwQZh
Add model
aad415c
about 1 month ago
config.json
985 Bytes
Add model
about 1 month ago
configuration_qwen3.py
11.7 kB
Add model
about 1 month ago
generation_config.json
Safe
138 Bytes
Add model
about 1 month ago
modeling_qwen3.py
73 kB
Add model
about 1 month ago
pytorch_model.bin
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch.BFloat16Storage"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
3.46 GB
LFS
Add model
about 1 month ago
tokenizer.json
Safe
7.03 MB
Add model
about 1 month ago
tokenizer_config.json
Safe
7.23 kB
Add model
about 1 month ago
vocab.json
Safe
2.78 MB
Add model
about 1 month ago