Dongfu jiang
jiangdongfu
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
5 days ago
Knapsack RL: Unlocking Exploration of LLMs via Optimizing Budget
Allocation
upvoted
a
paper
5 days ago
EditReward: A Human-Aligned Reward Model for Instruction-Guided Image
Editing
upvoted
a
paper
11 days ago
VCRL: Variance-based Curriculum Reinforcement Learning for Large
Language Models