Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
mradermacher
/
R-PRM-7B-DPO-GGUF
like
0
Reinforcement Learning
Transformers
GGUF
Chinese
reward-model
dpo
conversational
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Train
Deploy
Use this model
c491f4b
R-PRM-7B-DPO-GGUF
Ctrl+K
Ctrl+K
1 contributor
History:
8 commits
mradermacher
uploaded from rain
c491f4b
verified
3 months ago
.gitattributes
1.88 kB
uploaded from rain
3 months ago
R-PRM-7B-DPO.Q2_K.gguf
Safe
3.02 GB
xet
uploaded from rain
3 months ago
R-PRM-7B-DPO.Q3_K_M.gguf
Safe
3.81 GB
xet
uploaded from rain
3 months ago
R-PRM-7B-DPO.Q4_K_S.gguf
Safe
4.46 GB
xet
uploaded from rain
3 months ago
R-PRM-7B-DPO.Q6_K.gguf
Safe
6.25 GB
xet
uploaded from rain
3 months ago
R-PRM-7B-DPO.Q8_0.gguf
Safe
8.1 GB
xet
uploaded from rain
3 months ago
R-PRM-7B-DPO.f16.gguf
Safe
15.2 GB
xet
uploaded from rain
3 months ago
README.md
211 Bytes
uploaded from rain
3 months ago