KodCode-V1 Collection KodCode-V1 is the largest fully-synthetic open-source dataset providing verifiable solutions and tests for coding tasks. • 4 items • Updated 3 days ago • 2
KodCode: A Diverse, Challenging, and Verifiable Synthetic Dataset for Coding Paper • 2503.02951 • Published 6 days ago • 25
KodCode-V1 Collection KodCode-V1 is the largest fully-synthetic open-source dataset providing verifiable solutions and tests for coding tasks. • 4 items • Updated 3 days ago • 2
KodCode: A Diverse, Challenging, and Verifiable Synthetic Dataset for Coding Paper • 2503.02951 • Published 6 days ago • 25
DenseRewardRLHF-PPO Collection This repository contains the released models for our paper Segmenting Text and Learning Their Rewards for Improved RLHF in Language Model. • 18 items • Updated Jan 11 • 1
Segmenting Text and Learning Their Rewards for Improved RLHF in Language Model Paper • 2501.02790 • Published Jan 6 • 9
Segmenting Text and Learning Their Rewards for Improved RLHF in Language Model Paper • 2501.02790 • Published Jan 6 • 9 • 2
Efficient-VQGAN: Towards High-Resolution Image Generation with Efficient Vision Transformers Paper • 2310.05400 • Published Oct 9, 2023 • 1
TransEditor: Transformer-Based Dual-Space GAN for Highly Controllable Facial Editing Paper • 2203.17266 • Published Mar 31, 2022
Relative Preference Optimization: Enhancing LLM Alignment through Contrasting Responses across Identical and Diverse Prompts Paper • 2402.10958 • Published Feb 12, 2024
Segmenting Text and Learning Their Rewards for Improved RLHF in Language Model Paper • 2501.02790 • Published Jan 6 • 9
DenseRewardRLHF-PPO Collection This repository contains the released models for our paper Segmenting Text and Learning Their Rewards for Improved RLHF in Language Model. • 18 items • Updated Jan 11 • 1