Configurable Preference Tuning with Rubric-Guided Synthetic Data Paper • 2506.11702 • Published Jun 13 • 2 • 2
MetaSC: Test-Time Safety Specification Optimization for Language Models Paper • 2502.07985 • Published Feb 11 • 3 • 2