appy1234
/

Llama-3.2-3B-Instruct-Int8DynamicActivationInt8WeightQuantized

Text Generation

feature-extraction

torchao-my-repo

text-generation-inference

Model card Files Files and versions Community

Llama-3.2-3B-Instruct-Int8DynamicActivationInt8WeightQuantized

Ctrl+K

Ctrl+K

1 contributor

History: 2 commits

appy1234's picture

Upload folder using huggingface_hub

c50f766 verified about 1 month ago

.gitattributes

1.57 kB

Upload folder using huggingface_hub about 1 month ago
README.md

42.3 kB

Upload folder using huggingface_hub about 1 month ago
config.json

1.47 kB

Upload folder using huggingface_hub about 1 month ago
pytorch_model.bin
Detected Pickle imports (16)
- "torchao.dtypes.utils.PlainLayout",
- "torch.CharStorage",
- "torch.device",
- "torch.bfloat16",
- "torchao.quantization.linear_activation_quantized_tensor.LinearActivationQuantizedTensor",
- "collections.OrderedDict",
- "torch._utils._rebuild_wrapper_subclass",
- "torch.serialization._get_layout",
- "torchao.quantization.quant_api._int8_symm_per_token_reduced_range_quant",
- "torch.BFloat16Storage",
- "torchao.dtypes.uintx.plain_layout.PlainAQTTensorImpl",
- "torch.int8",
- "torchao.dtypes.affine_quantized_tensor.AffineQuantizedTensor",
- "torchao.quantization.quant_primitives.ZeroPointDomain",
- "torch._utils._rebuild_tensor_v2",
- "torch._tensor._rebuild_from_type_v2"
How to fix it?
3.61 GB
LFS

Upload folder using huggingface_hub about 1 month ago
special_tokens_map.json

296 Bytes

Upload folder using huggingface_hub about 1 month ago
tokenizer.json

17.2 MB
LFS

Upload folder using huggingface_hub about 1 month ago
tokenizer_config.json

54.6 kB

Upload folder using huggingface_hub about 1 month ago