pB09204048
pb09204048
ยท
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
15 days ago
Debunk the Myth of SFT Generalization
upvoted
a
paper
16 days ago
Planner-R1: Reward Shaping Enables Efficient Agentic RL with Smaller
LLMs
liked
a dataset
21 days ago
anisha2102/RaR-Science-20k-o3-mini
Organizations
None yet