ORLM: Training Large Language Models for Optimization Modeling Paper • 2405.17743 • Published May 28, 2024 • 3
RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques Paper • 2501.14492 • Published Jan 24 • 34
Enabling Scalable Oversight via Self-Evolving Critic Paper • 2501.05727 • Published Jan 10 • 76
Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models Paper • 2410.07985 • Published Oct 10, 2024 • 34
DPTDR: Deep Prompt Tuning for Dense Passage Retrieval Paper • 2208.11503 • Published Aug 24, 2022
Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models Paper • 2402.13064 • Published Feb 20, 2024 • 51
MathScale: Scaling Instruction Tuning for Mathematical Reasoning Paper • 2403.02884 • Published Mar 5, 2024 • 17