ENOT-AutoDL
/

gpt-j-6B-tensorrt-int8

Text Generation

text-generation-inference

Model card Files Files and versions Community

gpt-j-6B-tensorrt-int8

Ctrl+K

Ctrl+K

3 contributors

History: 15 commits

igor

updated metrics table

2b7d07d about 2 years ago

.gitattributes

1.57 kB

added onnx model (fake quant) compatible with trt about 2 years ago
NVIDIA_GeForce_RTX_2080_Ti-8_5_3_1-i8f32.engine

8.5 GB
LFS

added 2080ti engine about 2 years ago
NVIDIA_GeForce_RTX_3080_Ti-8_5_3_1-i8f32.engine

8.5 GB
LFS

normalized engine name about 2 years ago
NVIDIA_GeForce_RTX_4090-8_5_3_1-i8f32.engine

8.5 GB
LFS

added 4090 engine about 2 years ago
README.md

1.74 kB

updated metrics table about 2 years ago
gptj-i8.data

24.3 GB
LFS

added onnx model (fake quant) compatible with trt about 2 years ago
gptj-i8.onnx

1.61 MB
LFS

added onnx model (fake quant) compatible with trt about 2 years ago