pB09204048
pb09204048
ยท
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
17 days ago
Debunk the Myth of SFT Generalization
upvoted
a
paper
18 days ago
Planner-R1: Reward Shaping Enables Efficient Agentic RL with Smaller
LLMs
liked
a dataset
23 days ago
anisha2102/RaR-Science-20k-o3-mini
Organizations
None yet