Falcon 40B Inference at 4bit in Google Colab
pinned
27
#38 opened over 1 year ago
by
serin32
Custom 4-bit Finetuning 5-7 times faster inference than QLora
pinned
6
#25 opened over 1 year ago
by
rmihaylov
remove-extra-parentheses
#115 opened 7 months ago
by
ZennyKenny
![](https://cdn-avatars.huggingface.co/v1/production/uploads/656e3808d4de03a07d116850/62cFw46AmuhdI3gS24F1M.jpeg)
Could not locate the configuration_RW.py inside tiiuae/falcon-40b-instruct.
#114 opened 10 months ago
by
cosmino
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/Pk0DpkW2qigMUhfA-1TkU.jpeg)
[AUTOMATED] Model Memory Requirements
#113 opened 10 months ago
by
model-sizer-bot
Adding Evaluation Results
#111 opened 11 months ago
by
leaderboard-pr-bot
![](https://cdn-avatars.huggingface.co/v1/production/uploads/655506df9dc61e22c5f9c732/IZGvup0FdVlioPPIPnzZv.jpeg)
Could someone upload a tokenizer.model file? to allow for making ggufs
#110 opened about 1 year ago
by
RonanMcGovern
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/-6Yq7oM_Ju6Zi2GEvobvb.jpeg)
Add chat_template so that it can be used for chat out-of-box
#109 opened over 1 year ago
by
chujiezheng
![](https://cdn-avatars.huggingface.co/v1/production/uploads/610b70452719facd4ea85e28/S7nMy7D0Rxq0VIVblhYDG.jpeg)
pb when testing the model
#108 opened over 1 year ago
by
louvivien
Update generation_config.json
1
#106 opened over 1 year ago
by
nkasmanoff
![](https://cdn-avatars.huggingface.co/v1/production/uploads/60d3850107da9c17c7270912/WzhEbEvjunrDJ2IpdOxtZ.png)
Gradio interface
#105 opened over 1 year ago
by
sequentialsystems
![](https://cdn-avatars.huggingface.co/v1/production/uploads/64e71ab2e9fc9d0475ed6143/KPkW28VGm5pYhQx9OETax.jpeg)
Optimizing Inference Time for Chat Conversations on Falcon
2
#104 opened over 1 year ago
by
humza-sami
![](https://cdn-avatars.huggingface.co/v1/production/uploads/633d6d4f48ab6a0add2ce1a3/qTO75kR0hk1Yn1SaP7ZPb.jpeg)
Finetuned Falcon40 is not working with pipeline (text-generation)
#103 opened over 1 year ago
by
chelouche9
![](https://cdn-avatars.huggingface.co/v1/production/uploads/64749163e0b188d3cb2139d9/1gG08kGxUA-39EE8njT2W.png)
Advice on inference over a large-ish dataset in Databricks?
#102 opened over 1 year ago
by
archonlith
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/adLUysfQCBOaoYk8LVGBD.jpeg)
Use input attention mask instead of casual mask in attention
#101 opened over 1 year ago
by
CyberZHG
Inference
4
#99 opened over 1 year ago
by
davidhung
Best Practice for Handling Variable-Length Sequences in Training an LLM Model on a Chatbot Dataset
#98 opened over 1 year ago
by
humza-sami
![](https://cdn-avatars.huggingface.co/v1/production/uploads/633d6d4f48ab6a0add2ce1a3/qTO75kR0hk1Yn1SaP7ZPb.jpeg)
Request: DOI
#97 opened over 1 year ago
by
waelTalan
Getting HTTP Error Code: 422 when using Inference API
2
#96 opened over 1 year ago
by
reetkat
Run falcon on Mac
2
#95 opened over 1 year ago
by
corin9122
Unable to use all cores.
2
#94 opened over 1 year ago
by
armx40
![](https://cdn-avatars.huggingface.co/v1/production/uploads/645946a9e2b310359728e364/tLbC4kRxJyqBf02A33tT0.jpeg)
Bug: the model's head dimensionality is hardcoded
#93 opened over 1 year ago
by
deleted
Fine-tune on model response only?
1
#92 opened over 1 year ago
by
mkserge
Finetuning Base Falcon on Unseen Language/New data (non instruct/RLHF)
2
#91 opened over 1 year ago
by
AshBam
Slow response time for 7b and 40b
6
#89 opened over 1 year ago
by
kartik99
configuration_RW.py Missing in the latest commit
#88 opened over 1 year ago
by
ravikiran3690
Update README.md
2
#87 opened over 1 year ago
by
FelixMildon
Falcon breaks after the second prompt of code.
#86 opened over 1 year ago
by
thecowmilk
Changes in modelling_RW.py to be able to handle past_key_values for faster model generations
8
#85 opened over 1 year ago
by
puru22
@TII Falcon is stunning but will you continue or is the majestic bird destined to starve ?
#84 opened over 1 year ago
by
cmp-nct
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6344a1b0762379fc63017e62/g4VIT8l2lZIj6AoQAwVy7.png)
Finetune Error using the notebook referred on the model page
#83 opened over 1 year ago
by
hamad
Nvidia H100 Finetuning Error on BitsandBytes
2
#82 opened over 1 year ago
by
ashmitbhattarai
new here, confused which .bin file to download?
#80 opened over 1 year ago
by
kingofdelphi
Update generation_config.json
#77 opened over 1 year ago
by
psinger
![](https://cdn-avatars.huggingface.co/v1/production/uploads/636d18755aaed143cd6698ef/AalDh13Gp8jv1BfM5IASh.png)
Request: DOI
#76 opened over 1 year ago
by
winter6below618
Seeking insights on integrating RAG with Falcon for Domain Specific requirements
#75 opened over 1 year ago
by
rahul2008d
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1670590059080-noauth.jpeg)
Prevent Hallucinations
1
#74 opened over 1 year ago
by
Zhaoqiong
Deployment on Azure ML
1
#73 opened over 1 year ago
by
Eliahu551818
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/MA1oN0Txu3552kgKUbb2B.jpeg)
Access To Hidden States
#72 opened over 1 year ago
by
DJT777
Were special tokens trained?
#71 opened over 1 year ago
by
Tron2060
Example code from README output is nonsense
1
#70 opened over 1 year ago
by
amitgurintecom
New language
2
#69 opened over 1 year ago
by
mindplay
GPU requirements
7
#68 opened over 1 year ago
by
GuySerk
Cuda out of memory error.
2
#67 opened over 1 year ago
by
ibrim
ValueError: The following model_kwargs are not used by the model: ['token_type_ids'] (note: typos in the generate arguments will also show up in this list)
1
#66 opened over 1 year ago
by
yiz4869
How to fine tune falcon for summarization on xsum?
1
#65 opened over 1 year ago
by
uzumakiusa
Need claritiy about the adjustable model hyperparameters
#64 opened over 1 year ago
by
Someshfengde
Update README.md
#63 opened over 1 year ago
by
Gage888
Borken docs link Use in transformers
1
#62 opened over 1 year ago
by
natika1
![](https://cdn-avatars.huggingface.co/v1/production/uploads/63f2458e7ddf724fbcc08027/k8r11ePnzjmEd_qVuxCd6.png)
Hello, may I know where can I get the embeddings for falcon-40b?
3
#61 opened over 1 year ago
by
kurtgan