Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

philschmid
/
falcon-40b-instruct-GPTQ-inference-endpoints

Text Generation
Transformers
English
RefinedWeb
custom_code
Model card Files Files and versions Community
falcon-40b-instruct-GPTQ-inference-endpoints
Ctrl+K
Ctrl+K
  • 2 contributors
History: 6 commits
philschmid's picture
philschmid
Update handler.py
abdc7a2 almost 2 years ago
  • .gitattributes
    1.48 kB
    Duplicate from TheBloke/falcon-40b-instruct-GPTQ almost 2 years ago
  • README.md
    14.2 kB
    Duplicate from TheBloke/falcon-40b-instruct-GPTQ almost 2 years ago
  • config.json
    721 Bytes
    Duplicate from TheBloke/falcon-40b-instruct-GPTQ almost 2 years ago
  • configuration_RW.py
    2.51 kB
    Duplicate from TheBloke/falcon-40b-instruct-GPTQ almost 2 years ago
  • generation_config.json
    111 Bytes
    Duplicate from TheBloke/falcon-40b-instruct-GPTQ almost 2 years ago
  • gptq_model-4bit--1g.safetensors
    22.5 GB
    LFS
    Duplicate from TheBloke/falcon-40b-instruct-GPTQ almost 2 years ago
  • handler.py
    1.5 kB
    Update handler.py almost 2 years ago
  • modelling_RW.py
    47.1 kB
    Duplicate from TheBloke/falcon-40b-instruct-GPTQ almost 2 years ago
  • quantize_config.json
    183 Bytes
    Duplicate from TheBloke/falcon-40b-instruct-GPTQ almost 2 years ago
  • requirements.txt
    92 Bytes
    Update requirements.txt almost 2 years ago
  • special_tokens_map.json
    281 Bytes
    Duplicate from TheBloke/falcon-40b-instruct-GPTQ almost 2 years ago
  • tokenizer.json
    2.73 MB
    Duplicate from TheBloke/falcon-40b-instruct-GPTQ almost 2 years ago
  • tokenizer_config.json
    220 Bytes
    Duplicate from TheBloke/falcon-40b-instruct-GPTQ almost 2 years ago