Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Yuxiao Qu's picture
1 2

Yuxiao Qu PRO

CohenQu
AlgoDistill's profile picture LighterDarkness's profile picture Asap7772's profile picture
·

AI & ML interests

None yet

Recent Activity

updated a dataset 3 minutes ago
CohenQu/finemath-4plus-flexible-ordering.00.09
updated a dataset 13 minutes ago
CohenQu/Joint_train_final_stage_RL
published a dataset 13 minutes ago
CohenQu/Joint_train_final_stage_RL
View all activity

Organizations

HF CMU Collab's profile picture Information Seeking's profile picture Active Reasoning's profile picture CMU Artificial Intelligence and Reinforcement Learning (AIRe) Lab's profile picture CMU DeepScaleR Colab's profile picture OlympicCoder's profile picture

authored 3 papers 4 months ago

Recursive Introspection: Teaching Language Model Agents How to Self-Improve

Paper • 2407.18219 • Published Jul 25, 2024 • 3

Guided Data Augmentation for Offline Reinforcement Learning and Imitation Learning

Paper • 2310.18247 • Published Oct 27, 2023

Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning

Paper • 2503.07572 • Published Mar 10 • 46
authored a paper 8 months ago

Harnessing Webpage UIs for Text-Rich Visual Understanding

Paper • 2410.13824 • Published Oct 17, 2024 • 32
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs