THU-KEG
/

WildReward-8B

Text Classification

text-embeddings-inference

Model card Files Files and versions

Wesleythu commited on 7 days ago

Commit

31539eb

·

verified ·

1 Parent(s): a2301d9

Update README.md

Files changed (1) hide show

README.md +9 -0

README.md CHANGED Viewed

@@ -109,6 +109,15 @@ WildReward achieves competitive results on standard reward model benchmarks whil
 ## Citation
 ```bibtex
 ```
 ## License

 ## Citation
 ```bibtex
+@misc{peng2026wildrewardlearningrewardmodels,
+      title={WildReward: Learning Reward Models from In-the-Wild Human Interactions},
+      author={Hao Peng and Yunjia Qi and Xiaozhi Wang and Zijun Yao and Lei Hou and Juanzi Li},
+      year={2026},
+      eprint={2602.08829},
+      archivePrefix={arXiv},
+      primaryClass={cs.CL},
+      url={https://arxiv.org/abs/2602.08829},
+}
 ```
 ## License