ReSum: Unlocking Long-Horizon Search Intelligence via Context Summarization Paper • 2509.13313 • Published 6 days ago • 64
Inpainting-Guided Policy Optimization for Diffusion Large Language Models Paper • 2509.10396 • Published 10 days ago • 15
view article Article mem-agent: Persistent, Human Readable Memory Agent Trained with Online RL By driaforall and 1 other • 11 days ago • 19
view article Article Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers By ariG23498 and 6 others • 12 days ago • 144
SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning Paper • 2509.09674 • Published 11 days ago • 73
AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning Paper • 2509.08755 • Published 12 days ago • 55
Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search Paper • 2509.07969 • Published 13 days ago • 59
Parallel-R1: Towards Parallel Thinking via Reinforcement Learning Paper • 2509.07980 • Published 13 days ago • 96
view article Article How to Choose the Best Open Source LLM for Your Project in 2025 By dvilasuero • 13 days ago • 70
Efficient Multi-Source Knowledge Transfer by Model Merging Paper • 2508.19353 • Published 27 days ago • 1
DivMerge: A divergence-based model merging method for multi-tasking Paper • 2509.02108 • Published 20 days ago • 24