Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

ENOT-AutoDL
/
gpt-j-6B-tensorrt-int8

Text Generation
Transformers
ONNX
English
text-generation-inference
causal-lm
int8
tensorrt
ENOT-AutoDL
Model card Files Files and versions Community
2
gpt-j-6B-tensorrt-int8
Ctrl+K
Ctrl+K
  • 3 contributors
History: 15 commits
igor
updated metrics table
2b7d07d almost 2 years ago
  • .gitattributes
    1.57 kB
    added onnx model (fake quant) compatible with trt almost 2 years ago
  • NVIDIA_GeForce_RTX_2080_Ti-8_5_3_1-i8f32.engine
    8.5 GB
    LFS
    added 2080ti engine about 2 years ago
  • NVIDIA_GeForce_RTX_3080_Ti-8_5_3_1-i8f32.engine
    8.5 GB
    LFS
    normalized engine name about 2 years ago
  • NVIDIA_GeForce_RTX_4090-8_5_3_1-i8f32.engine
    8.5 GB
    LFS
    added 4090 engine about 2 years ago
  • README.md
    1.74 kB
    updated metrics table almost 2 years ago
  • gptj-i8.data
    24.3 GB
    LFS
    added onnx model (fake quant) compatible with trt almost 2 years ago
  • gptj-i8.onnx
    1.61 MB
    LFS
    added onnx model (fake quant) compatible with trt almost 2 years ago