linlei's picture

3 2

linlei

thinkaboutzero

·

AI & ML interests

natural language processing

Recent Activity

upvoted a paper 5 days ago

ASPO: Asymmetric Importance Sampling Policy Optimization

upvoted a paper 12 days ago

Attention as a Compass: Efficient Exploration for Process-Supervised RL in Reasoning Models

liked a model about 1 month ago

Kwai-Klear/Klear-46B-A2.5B-Instruct

View all activity

Organizations

None yet

upvoted a paper 5 days ago

ASPO: Asymmetric Importance Sampling Policy Optimization

Paper • 2510.06062 • Published 5 days ago • 13

upvoted a paper 12 days ago

Attention as a Compass: Efficient Exploration for Process-Supervised RL in Reasoning Models

Paper • 2509.26628 • Published 12 days ago • 12

upvoted a paper 3 months ago

Stabilizing Knowledge, Promoting Reasoning: Dual-Token Constraints for RLVR

Paper • 2507.15778 • Published Jul 21 • 20