Qwen3-Quantization Collection This is the official quantized models collection of Qwen3 Quantization • 42 items • Updated 5 days ago • 5
Putting the Value Back in RL: Better Test-Time Scaling by Unifying LLM Reasoners With Verifiers Paper • 2505.04842 • Published 4 days ago • 12
NEMOTRON-CROSSTHINK: Scaling Self-Learning beyond Math Reasoning Paper • 2504.13941 • Published 26 days ago • 10
The Sparse Frontier: Sparse Attention Trade-offs in Transformer LLMs Paper • 2504.17768 • Published 18 days ago • 13
Skywork R1V2: Multimodal Hybrid Reinforcement Learning for Reasoning Paper • 2504.16656 • Published 19 days ago • 55
Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory Paper • 2504.19413 • Published 14 days ago • 12
RAGEN: Understanding Self-Evolution in LLM Agents via Multi-Turn Reinforcement Learning Paper • 2504.20073 • Published 18 days ago • 9
Taming the Titans: A Survey of Efficient LLM Inference Serving Paper • 2504.19720 • Published 14 days ago • 10
Beyond the Last Answer: Your Reasoning Trace Uncovers More than You Think Paper • 2504.20708 • Published 13 days ago • 22
Softpick: No Attention Sink, No Massive Activations with Rectified Softmax Paper • 2504.20966 • Published 13 days ago • 26
Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math Paper • 2504.21233 • Published 12 days ago • 39