wx13
wx13
ยท
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 2 months ago
Self-rewarding correction for mathematical reasoning
liked
a dataset
12 months ago
RLHFlow/prompt-collection-v0.1
upvoted
a
collection
12 months ago
Online RLHF
Organizations
None yet
models
0
None public yet
datasets
0
None public yet