Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
QwQZh
/
gated_attention
like
4
Model card
Files
Files and versions
Community
aad415c
gated_attention
/
1B_gate_elementwise
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
QwQZh
Add model
aad415c
3 months ago
config.json
985 Bytes
Add model
3 months ago
configuration_qwen3.py
11.7 kB
Add model
3 months ago
generation_config.json
Safe
138 Bytes
Add model
3 months ago
modeling_qwen3.py
73 kB
Add model
3 months ago
pytorch_model.bin
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch.BFloat16Storage"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
3.46 GB
LFS
Add model
3 months ago
tokenizer.json
Safe
7.03 MB
Add model
3 months ago
tokenizer_config.json
Safe
7.23 kB
Add model
3 months ago
vocab.json
Safe
2.78 MB
Add model
3 months ago