AIFGEN Synthetic Preference Datasets for Continual Reinforcement Learning from Human Feedback - https://github.com/ComplexData-MILA/AIF-Gen LifelongAlignment/aifgen-long-piecewise Viewer • Updated May 16, 2025 • 1 • 24 LifelongAlignment/aifgen-domain-preference-shift Viewer • Updated May 16, 2025 • 1 • 26 LifelongAlignment/aifgen-piecewise-preference-shift Viewer • Updated May 16, 2025 • 1 • 21 LifelongAlignment/aifgen-lipschitz Viewer • Updated May 16, 2025 • 1 • 19
AIFGEN Synthetic Preference Datasets for Continual Reinforcement Learning from Human Feedback - https://github.com/ComplexData-MILA/AIF-Gen LifelongAlignment/aifgen-long-piecewise Viewer • Updated May 16, 2025 • 1 • 24 LifelongAlignment/aifgen-domain-preference-shift Viewer • Updated May 16, 2025 • 1 • 26 LifelongAlignment/aifgen-piecewise-preference-shift Viewer • Updated May 16, 2025 • 1 • 21 LifelongAlignment/aifgen-lipschitz Viewer • Updated May 16, 2025 • 1 • 19