nbeerbower's picture
Update README.md
f2440cd verified
metadata
library_name: transformers
license: apache-2.0
datasets:
  - nbeerbower/GreatFirewall-DPO
  - nbeerbower/Schule-DPO
  - nbeerbower/Purpura-DPO
  - nbeerbower/Arkhaios-DPO
  - jondurbin/truthy-dpo-v0.1
  - antiven0m/physical-reasoning-dpo
  - flammenai/Date-DPO-NoAsterisks
  - flammenai/Prude-Phi3-DPO
  - Atsunori/HelpSteer2-DPO
  - jondurbin/gutenberg-dpo-v0.1
  - nbeerbower/gutenberg2-dpo
  - nbeerbower/gutenberg-moderne-dpo
base_model:
  - nbeerbower/Rombos-EVAGutenberg-TIES-Qwen2.5-32B

image/png

Dumpling-Qwen2.5-32B-v2

nbeerbower/Rombos-EVAGutenberg-TIES-Qwen2.5-32B finetuned on:

Method

QLoRA ORPO tuned with 8x A100 for 2 epochs. Rank 64 LoRA, 2e-5 learning rate.