Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
cs552-mlp 's Collections
Submitted Models
M3: Quantisation
M3: SFT for MCQA
M2: DPO Aligned Models

M2: DPO Aligned Models

updated Jun 14, 2024

Collection of the QLoRA Phi-3 DPO aligned models, each having different hyper-parameters.

Upvote
-

  • cs552-mlp/phi3-dpo-h4

    Updated May 28, 2024 • 3

  • cs552-mlp/phi3-dpo-h2

    Updated May 28, 2024 • 3

  • cs552-mlp/phi3-dpo-m2

    Updated May 28, 2024 • 3

  • cs552-mlp/phi3-dpo-h1

    Updated May 28, 2024 • 4

  • cs552-mlp/phi3-dpo-m1

    Updated May 28, 2024 • 5

  • cs552-mlp/phi3-dpo-h3

    Updated May 28, 2024 • 27

  • cs552-mlp/phi3-dpo-h5

    Updated May 28, 2024 • 33
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs