SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models Paper • 2502.09604 • Published about 19 hours ago • 19
SYNTHETIC-1 Collection A collection of tasks & verifiers for reasoning datasets • 5 items • Updated 8 days ago • 33
🧠Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 11 items • Updated 3 days ago • 61
Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback Paper • 2501.10799 • Published 27 days ago • 15
M-Longdoc: A Benchmark For Multimodal Super-Long Document Understanding And A Retrieval-Aware Tuning Framework Paper • 2411.06176 • Published Nov 9, 2024 • 45