Bigger Body 70b

A roleplay-focused ~~pseudo full-finetune~~ qlora finetune of Llama 3.3 70b. The successor to the Ink series.

Dataset

The Bigger Body (referred to as Ink v2.1, because that's still the internal name) mix is absolutely disgusting. It's even more cursed than the original Ink mix.

(Public) Original Datasets

Fizzarolli/limarp-processed
Norquinal/OpenCAI - two_users split
allura-org/Celeste1.x-data-mixture
mapsila/PIPPA-ShareGPT-formatted-named
allenai/tulu-3-sft-personas-instruction-following
readmehay/medical-01-reasoning-SFT-json
LooksJuicy/ruozhiba
shibing624/roleplay-zh-sharegpt-gpt4-data
CausalLM/Retrieval-SFT-Chat
ToastyPigeon/fujin-filtered-instruct

Quants

bartowski's imatrix ggufs
readyart's exl2 quants

Recommended Settings

Chat template: Llama 3 Instruct
Recommended samplers (not the be-all-end-all, try some on your own!):

I have literally no idea. you're on your own.

Hyperparams

General

Epochs = 2
LR = 1e-5
LR Scheduler = REX
Optimizer = CAME
Effective batch size = 16
Weight Decay = 0.01
Warmup steps = 0
Total steps = 920
Quantization = 4bit

LoRA

LoRA rank = 16
LoRA alpha = 32
LoRA dropout = 0.25

Credits

Humongous thanks to the people who created the data.
Big thanks to all Allura members for testing and emotional support ilya /platonic