Jacobsen Salt's picture

1 1

Jacobsen Salt

jskos

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 8 days ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

liked a model over 1 year ago

openbmb/MiniCPM-2B-sft-fp32

View all activity

Organizations

None yet

upvoted a paper 8 days ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published 8 days ago • 144

liked a model over 1 year ago

openbmb/MiniCPM-2B-sft-fp32

Text Generation • Updated Sep 7, 2024 • 1.23k • 295