Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

5456es
/
implicit_reward_Llama-3.2-1B-Instruct_prune_0.5-sigmoid

Safetensors
llama
dpo
preference-learning
implicit
pruned
Model card Files Files and versions
xet
Community
implicit_reward_Llama-3.2-1B-Instruct_prune_0.5-sigmoid
1.52 kB
  • 1 contributor
History: 1 commit
5456es's picture
5456es
initial commit
434333c verified 2 months ago
  • .gitattributes
    1.52 kB
    initial commit 2 months ago