philschmid
/

falcon-40b-instruct-GPTQ-inference-endpoints

Text Generation

Model card Files Files and versions Community

falcon-40b-instruct-GPTQ-inference-endpoints

Ctrl+K

Ctrl+K

2 contributors

History: 6 commits

philschmid's picture

Update handler.py

abdc7a2 about 2 years ago

.gitattributes

1.48 kB

Duplicate from TheBloke/falcon-40b-instruct-GPTQ about 2 years ago
README.md

14.2 kB

Duplicate from TheBloke/falcon-40b-instruct-GPTQ about 2 years ago
config.json

721 Bytes

Duplicate from TheBloke/falcon-40b-instruct-GPTQ about 2 years ago
configuration_RW.py

2.51 kB

Duplicate from TheBloke/falcon-40b-instruct-GPTQ about 2 years ago
generation_config.json

111 Bytes

Duplicate from TheBloke/falcon-40b-instruct-GPTQ about 2 years ago
gptq_model-4bit--1g.safetensors

22.5 GB
LFS

Duplicate from TheBloke/falcon-40b-instruct-GPTQ about 2 years ago
handler.py

1.5 kB

Update handler.py about 2 years ago
modelling_RW.py

47.1 kB

Duplicate from TheBloke/falcon-40b-instruct-GPTQ about 2 years ago
quantize_config.json

183 Bytes

Duplicate from TheBloke/falcon-40b-instruct-GPTQ about 2 years ago
requirements.txt

92 Bytes

Update requirements.txt about 2 years ago
special_tokens_map.json

281 Bytes

Duplicate from TheBloke/falcon-40b-instruct-GPTQ about 2 years ago
tokenizer.json

2.73 MB

Duplicate from TheBloke/falcon-40b-instruct-GPTQ about 2 years ago
tokenizer_config.json

220 Bytes

Duplicate from TheBloke/falcon-40b-instruct-GPTQ about 2 years ago