Training procedure
Fine-tuned version of Falcon-180B using PEFT LoRA + DeepSpeed ZeRO3 + Flash Attention + Activation Checkpointing. Read the blog Falcon 180B Finetuning using ๐ค PEFT and DeepSpeed for more information.
Framework versions
- PEFT 0.6.0.dev0
- Downloads last month
- 4
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support