metadata
license: llama3.1
tags:
- not-for-all-audiences
Techne-RP-8b
Trained with Llama 3 prompt formatting, Alpaca works too
Assistant Example @ q5_k_m
NSFW Writing Example @ q5_k_m
Training Methodology
athirdpath/Llama-3.1-Instruct_NSFW-pretrained_e1-plus_reddit was further trained in the order below:
SFT
- Doctor-Shotgun/no-robots-sharegpt
- grimulkan/LimaRP-augmented
- Inv/c2-logs-cleaned-deslopped
DPO
- jondurbin/truthy-dpo-v0.1
- Undi95/Weyaxi-humanish-dpo-project-noemoji
- athirdpath/DPO_Pairs-Roleplay-Llama3-NSFW