Austin Xu's picture

1

Austin Xu

austinxu87

·

AI & ML interests

None yet

Organizations

upvoted a paper 4 months ago

J4R: Learning to Judge with Equivalent Initial State Group Relative Policy Optimization

Paper • 2505.13346 • Published May 19 • 2