Llama-3.2-1B-Instruct-ONNX / cuda /cuda-int4-rtn-block-32

Commit History

Upload config.json
e983c74
verified

parinitarahi commited on