Benchmarking Language Model Creativity: A Case Study on Code Generation Paper • 2407.09007 • Published Jul 12, 2024 • 4
RATIONALYST: Pre-training Process-Supervision for Improving Reasoning Paper • 2410.01044 • Published Oct 1, 2024 • 38
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning Paper • 2409.12183 • Published Sep 18, 2024 • 40