Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
yicui 's Collections
Mechanistic
Coding
Benchmark
Training
ICL
Architecture
RL
TDD
Theory
Instructions

RL

updated Jan 15
Upvote
-

  • LLM-Powered Code Vulnerability Repair with Reinforcement Learning and Semantic Reward

    Paper • 2401.03374 • Published Jan 7, 2024

  • Code Security Vulnerability Repair Using Reinforcement Learning with Large Language Models

    Paper • 2401.07031 • Published Jan 13, 2024

  • Coarse-Tuning Models of Code with Reinforcement Learning Feedback

    Paper • 2305.18341 • Published May 25, 2023

  • Reinforcement Learning from Automatic Feedback for High-Quality Unit Test Generation

    Paper • 2310.02368 • Published Oct 3, 2023

  • TDD Without Tears: Towards Test Case Generation from Requirements through Deep Reinforcement Learning

    Paper • 2401.07576 • Published Jan 15, 2024

  • The Lessons of Developing Process Reward Models in Mathematical Reasoning

    Paper • 2501.07301 • Published Jan 13 • 98
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs