-
Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP Researchers
Paper • 2409.04109 • Published • 48 -
Training Language Models to Self-Correct via Reinforcement Learning
Paper • 2409.12917 • Published • 140 -
Reward-Robust RLHF in LLMs
Paper • 2409.15360 • Published • 6 -
EuroLLM: Multilingual Language Models for Europe
Paper • 2409.16235 • Published • 28
Haote Yang
Hoter
AI & ML interests
None yet
Recent Activity
liked
a model
5 days ago
opendatalab/MinerU2.5-2509-1.2B
upvoted
a
paper
2 months ago
VisionThink: Smart and Efficient Vision Language Model via Reinforcement
Learning