LLM Reasoning - a kaitou951 Collection

kaitou951 's Collections

LLM Reasoning

updated 4 days ago

Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models

Paper • 2402.07754 • Published Feb 12, 2024
Reinforcing the Diffusion Chain of Lateral Thought with Diffusion Language Models

Paper • 2505.10446 • Published May 15
A Survey on Latent Reasoning

Paper • 2507.06203 • Published 13 days ago • 82
Reasoning Beyond Language: A Comprehensive Survey on Latent Chain-of-Thought Reasoning

Paper • 2505.16782 • Published May 22
Boosting Latent Diffusion with Flow Matching

Paper • 2312.07360 • Published Dec 12, 2023 • 3
Play to Generalize: Learning to Reason Through Game Play

Paper • 2506.08011 • Published Jun 9 • 15
Code2Logic: Game-Code-Driven Data Synthesis for Enhancing VLMs General Reasoning

Paper • 2505.13886 • Published May 20 • 6
lmgame-Bench: How Good are LLMs at Playing Games?

Paper • 2505.15146 • Published May 21 • 20
Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6 • 178
SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

Paper • 2506.24119 • Published 21 days ago • 44
Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning

Paper • 2502.14768 • Published Feb 20 • 48
Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning

Paper • 2507.00432 • Published 20 days ago • 69
REASONING GYM: Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Paper • 2505.24760 • Published May 30 • 64
I Think, Therefore I Diffuse: Enabling Multimodal In-Context Reasoning in Diffusion Models

Paper • 2502.10458 • Published Feb 12 • 37