Tianjian Li's picture

Tianjian Li

dogtooth

·

https://tianjianl.github.io

tianjianl

AI & ML interests

None yet

Recent Activity

updated a dataset about 9 hours ago

dogtooth/reasoning_state_rl

published a dataset about 9 hours ago

dogtooth/reasoning_state_rl

updated a model 2 days ago

dogtooth/Qwen3-4B-Instruct-2507-SFT

View all activity

Organizations

commented a paper 7 months ago

Jointly Reinforcing Diversity and Quality in Language Model Generations

Paper • 2509.02534 • Published Sep 2, 2025 • 25 •

commented a paper 10 months ago

SIMPLEMIX: Frustratingly Simple Mixing of Off- and On-policy Data in Language Model Preference Learning

Paper • 2505.02363 • Published May 5, 2025 • 7 •