Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

wangclnlp
/
robust_visual_reward_model

Safetensors
vision
DPO
RLHF
preference
feedback
reward model
preference model
Model card Files Files and versions Community
robust_visual_reward_model
Ctrl+K
Ctrl+K
  • 2 contributors
History: 3 commits
gan-yang-zuzhu
update README.md
7186f2d 9 months ago
  • figure
    upload models. 9 months ago
  • .gitattributes
    1.52 kB
    initial commit 9 months ago
  • README.md
    4.89 kB
    update README.md 9 months ago
  • convert_pytorch_bin.py
    515 Bytes
    upload models. 9 months ago
  • model-00001-of-00003.safetensors
    4.99 GB
    LFS
    upload models. 9 months ago
  • model-00002-of-00003.safetensors
    4.96 GB
    LFS
    upload models. 9 months ago
  • model-00003-of-00003.safetensors
    4.18 GB
    LFS
    upload models. 9 months ago
  • model.safetensors.index.json
    72.9 kB
    upload models. 9 months ago