Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Alex Havrilla's picture
5 1 8

Alex Havrilla

Dahoas
niloysaha's profile picture LighterDarkness's profile picture admariner's profile picture
·
https://dahoas.github.io/
  • dahoas

AI & ML interests

NLP, RL

Organizations

CarperAI's profile picture DuckAI's profile picture Critiquers's profile picture An optimal synthetic data sampling strategy for MATH's profile picture

authored a paper 7 months ago

Surveying the Effects of Quality, Diversity, and Complexity in Synthetic Data From Large Language Models

Paper • 2412.02980 • Published Dec 4, 2024 • 15
authored 2 papers over 1 year ago

Teaching Large Language Models to Reason with Reinforcement Learning

Paper • 2403.04642 • Published Mar 7, 2024 • 51

GLoRe: When, Where, and How to Improve LLM Reasoning via Global and Local Refinements

Paper • 2402.10963 • Published Feb 13, 2024 • 12
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs