Tianjian Li's picture

1 7 8

Tianjian Li

dogtooth

·

https://tianjianl.github.io

truthbutcher

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

J1: Incentivizing Thinking in LLM-as-a-Judge via Reinforcement Learning

authored a paper about 2 months ago

SIMPLEMIX: Frustratingly Simple Mixing of Off- and On-policy Data in Language Model Preference Learning

upvoted a paper about 2 months ago

SIMPLEMIX: Frustratingly Simple Mixing of Off- and On-policy Data in Language Model Preference Learning

View all activity

Organizations

dogtooth 's models

None public yet