Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
rvs
/
llama3-8b-Instruct-kvc-AWQ-int4-onnx
like
0
ONNX
text-generation-inference
llama
llama3
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
llama3-8b-Instruct-kvc-AWQ-int4-onnx
6.79 GB
1 contributor
History:
2 commits
rvs
Upload folder using huggingface_hub
6b1fd65
verified
3 months ago
.gitattributes
Safe
1.66 kB
Upload folder using huggingface_hub
3 months ago
README.md
6.94 kB
Upload folder using huggingface_hub
3 months ago
config.json
Safe
0 Bytes
Upload folder using huggingface_hub
3 months ago
entrypoint.py
26.9 kB
Upload folder using huggingface_hub
3 months ago
model.onnx
5.73 GB
xet
Upload folder using huggingface_hub
3 months ago
onnx__MatMul_10363
1.05 GB
xet
Upload folder using huggingface_hub
3 months ago
special_tokens_map.json
Safe
301 Bytes
Upload folder using huggingface_hub
3 months ago
token_id_to_str.json
Safe
2.8 MB
Upload folder using huggingface_hub
3 months ago
tokenizer.json
Safe
9.09 MB
Upload folder using huggingface_hub
3 months ago
tokenizer_config.json
Safe
51 kB
Upload folder using huggingface_hub
3 months ago