Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
7
8
Tianjian Li
dogtooth
Follow
Fishtiks's profile picture
jackzhang's profile picture
2 followers
·
6 following
https://tianjianl.github.io
truthbutcher
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 2 months ago
J1: Incentivizing Thinking in LLM-as-a-Judge via Reinforcement Learning
authored
a paper
about 2 months ago
SIMPLEMIX: Frustratingly Simple Mixing of Off- and On-policy Data in Language Model Preference Learning
upvoted
a
paper
about 2 months ago
SIMPLEMIX: Frustratingly Simple Mixing of Off- and On-policy Data in Language Model Preference Learning
View all activity
Organizations
Papers
2
arxiv:
2505.02363
arxiv:
2310.00840
models
0
None public yet
datasets
218
Sort: Recently updated
dogtooth/helpsteer2_binarized_filtered
Viewer
•
Updated
Apr 5
•
2.51k
•
37
dogtooth/Big-Math-RL-Verified
Viewer
•
Updated
Apr 3
•
1.52M
•
29
dogtooth/default_project_dev_test
Viewer
•
Updated
Mar 26
•
4k
•
36
dogtooth/Big-Math-Selected-500
Viewer
•
Updated
Mar 25
•
3.5k
•
8
dogtooth/Big-Math-RL-Verified-Chinese
Viewer
•
Updated
Mar 6
•
251k
•
39
dogtooth/mmlu
Viewer
•
Updated
Mar 5
•
14.2k
•
114
dogtooth/boolq
Viewer
•
Updated
Mar 5
•
3.27k
•
35
dogtooth/gpqa
Viewer
•
Updated
Mar 5
•
448
•
46
dogtooth/math_qa
Viewer
•
Updated
Mar 5
•
2.99k
•
38
dogtooth/wiqa
Viewer
•
Updated
Mar 5
•
3k
•
28
View 218 datasets