ZeCO: Zero Communication Overhead Sequence Parallelism for Linear Attention Paper • 2507.01004 • Published 9 days ago • 10
Pitfalls of Rule- and Model-based Verifiers -- A Case Study on Mathematical Reasoning Paper • 2505.22203 • Published May 28 • 6
SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond Paper • 2505.19641 • Published May 26 • 67
SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond Paper • 2505.19641 • Published May 26 • 67
SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond Paper • 2505.19641 • Published May 26 • 67
hkust-nlp/Qwen-2.5-7B-Verifier-general-verifier Reinforcement Learning • 8B • Updated May 28 • 7
hkust-nlp/Qwen-2.5-7B-Verifier-general-verifier Reinforcement Learning • 8B • Updated May 28 • 7