Bigger Body 70b
A roleplay-focused pseudo full-finetune qlora finetune of Llama 3.3 70b.
The successor to the Ink series.
Dataset
The Bigger Body (referred to as Ink v2.1, because that's still the internal name) mix is absolutely disgusting. It's even more cursed than the original Ink mix.
(Public) Original Datasets
- Fizzarolli/limarp-processed
- Norquinal/OpenCAI -
two_users
split - allura-org/Celeste1.x-data-mixture
- mapsila/PIPPA-ShareGPT-formatted-named
- allenai/tulu-3-sft-personas-instruction-following
- readmehay/medical-01-reasoning-SFT-json
- LooksJuicy/ruozhiba
- shibing624/roleplay-zh-sharegpt-gpt4-data
- CausalLM/Retrieval-SFT-Chat
- ToastyPigeon/fujin-filtered-instruct
Quants
Recommended Settings
Chat template: Llama 3 Instruct
Recommended samplers (not the be-all-end-all, try some on your own!):
- I have literally no idea. you're on your own.
Hyperparams
General
- Epochs = 2
- LR = 1e-5
- LR Scheduler = REX
- Optimizer = CAME
- Effective batch size = 16
- Weight Decay = 0.01
- Warmup steps = 0
- Total steps = 920
- Quantization = 4bit
LoRA
- LoRA rank = 16
- LoRA alpha = 32
- LoRA dropout = 0.25
Credits
Humongous thanks to the people who created the data.
Big thanks to all Allura members for testing and emotional support ilya /platonic