Li Dong
unilm
AI & ML interests
Language Model Pre-Training
Recent Activity
upvoted
a
paper
9 days ago
Agent Lightning: Train ANY AI Agents with Reinforcement Learning
commented on
a paper
10 days ago
On the Generalization of SFT: A Reinforcement Learning Perspective with
Reward Rectification