Tianjian Li's picture

Tianjian Li

dogtooth

·

https://tianjianl.github.io

tianjianl

AI & ML interests

None yet

Recent Activity

updated a dataset about 24 hours ago

dogtooth/reasoning_state_rl

published a dataset about 24 hours ago

dogtooth/reasoning_state_rl

updated a model 3 days ago

dogtooth/Qwen3-4B-Instruct-2507-SFT

View all activity

Organizations

authored a paper 7 months ago

Jointly Reinforcing Diversity and Quality in Language Model Generations

Paper • 2509.02534 • Published Sep 2, 2025 • 25

authored a paper 10 months ago

SIMPLEMIX: Frustratingly Simple Mixing of Off- and On-policy Data in Language Model Preference Learning

Paper • 2505.02363 • Published May 5, 2025 • 7

authored a paper over 1 year ago

Error Norm Truncation: Robust Training in the Presence of Data Noise for Text Generation Models

Paper • 2310.00840 • Published Oct 2, 2023