HAF-RM: A Hybrid Alignment Framework for Reward Model Training Paper • 2407.04185 • Published Jul 4, 2024
ARKS: Active Retrieval in Knowledge Soup for Code Generation Paper • 2402.12317 • Published Feb 19, 2024
ALaRM: Align Language Models via Hierarchical Rewards Modeling Paper • 2403.06754 • Published Mar 11, 2024
DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation Paper • 2211.11501 • Published Nov 18, 2022