Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

nbeerbower
/
Dumpling-Qwen2.5-32B-v2

Text Generation
Transformers
Safetensors
qwen2
conversational
text-generation-inference
Model card Files Files and versions Community
1
  • Dumpling-Qwen2.5-32B-v2
    • Method

    image/png

    Dumpling-Qwen2.5-32B-v2

    nbeerbower/Rombos-EVAGutenberg-TIES-Qwen2.5-32B finetuned on:

    • nbeerbower/GreatFirewall-DPO
    • nbeerbower/Schule-DPO
    • nbeerbower/Purpura-DPO
    • nbeerbower/Arkhaios-DPO
    • jondurbin/truthy-dpo-v0.1
    • antiven0m/physical-reasoning-dpo
    • flammenai/Date-DPO-NoAsterisks
    • flammenai/Prude-Phi3-DPO
    • Atsunori/HelpSteer2-DPO
    • jondurbin/gutenberg-dpo-v0.1
    • nbeerbower/gutenberg2-dpo
    • nbeerbower/gutenberg-moderne-dpo.

    Method

    QLoRA ORPO tuned with 8x A100 for 2 epochs. Rank 64 LoRA, 2e-5 learning rate.

    Downloads last month
    41
    Safetensors
    Model size
    32.8B params
    Tensor type
    BF16
    ·
    Inference Providers NEW
    Text Generation
    This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

    Model tree for nbeerbower/Dumpling-Qwen2.5-32B-v2

    Base model

    nbeerbower/Rombos-EVAGutenberg-TIES-Qwen2.5-32B
    Finetuned
    (2)
    this model
    Merges
    6 models
    Quantizations
    13 models

    Datasets used to train nbeerbower/Dumpling-Qwen2.5-32B-v2

    jondurbin/gutenberg-dpo-v0.1

    Viewer • Updated Jan 12, 2024 • 918 • 907 • 142

    jondurbin/truthy-dpo-v0.1

    Viewer • Updated Jan 11, 2024 • 1.02k • 274 • 134

    Atsunori/HelpSteer2-DPO

    Viewer • Updated Jul 11, 2024 • 7.59k • 90 • 8

    Collection including nbeerbower/Dumpling-Qwen2.5-32B-v2

    Dumplings

    Collection
    Qwen2.5 finetunes aiming for decensorship and improved english prose • 9 items • Updated Feb 23 • 2
    Company
    TOS Privacy About Jobs
    Website
    Models Datasets Spaces Pricing Docs