Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Shihan Dou's picture
11 7 6

Shihan Dou

Ablustrund
21world's profile picture TaoJi's profile picture
·
  • Ablustrund

AI & ML interests

Natural Language Processing, Large Language Models

Recent Activity

authored a paper about 15 hours ago
Loose lips sink ships: Mitigating Length Bias in Reinforcement Learning from Human Feedback
authored a paper about 15 hours ago
Improving Generalization of Alignment with Human Preferences through Group Invariant Learning
authored a paper about 15 hours ago
LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment
View all activity

Organizations

OpenMOSS, Fudan NLP, SII's profile picture

Papers 22

arxiv:2507.05197
arxiv:2504.13914
arxiv:2502.17184
arxiv:2412.12505

models 1

Ablustrund/moss-rlhf-reward-model-7B-zh

Updated Jul 13, 2023 • 3 • 23

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs