Yury Panikov

panikov

panikov

AI & ML interests

None yet

Recent Activity

commentedon a paper 14 days ago

Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key

commentedon a paper 14 days ago

Reinforcement Learning for LLM-based Multi-Agent Systems through Orchestration Traces

commentedon a paper 14 days ago

Generate, Filter, Control, Replay: A Comprehensive Survey of Rollout Strategies for LLM Reinforcement Learning

View all activity

Organizations

None yet

commented 3 papers 14 days ago

Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key

Paper • 2605.06638 • Published 17 days ago • 14 •

Reinforcement Learning for LLM-based Multi-Agent Systems through Orchestration Traces

Paper • 2605.02801 • Published 20 days ago • 7 •

Generate, Filter, Control, Replay: A Comprehensive Survey of Rollout Strategies for LLM Reinforcement Learning

Paper • 2605.02913 • Published Apr 8 • 9 •

commented a paper 18 days ago

Recursive Multi-Agent Systems

Paper • 2604.25917 • Published 26 days ago • 273 •

commented a paper about 2 months ago

OpenWorldLib: A Unified Codebase and Definition of Advanced World Models

Paper • 2604.04707 • Published Apr 6 • 203 •

commented 4 papers 2 months ago

Mixture-of-Depths Attention

Paper • 2603.15619 • Published Mar 16 • 80 •

EnterpriseOps-Gym: Environments and Evaluations for Stateful Agentic Planning and Tool Use in Enterprise Settings

Paper • 2603.13594 • Published Mar 13 • 149 •

OpenSeeker: Democratizing Frontier Search Agents by Fully Open-Sourcing Training Data

Paper • 2603.15594 • Published Mar 16 • 149 •

AI Can Learn Scientific Taste

Paper • 2603.14473 • Published Mar 15 • 426 •

commented a paper 5 months ago

When Reasoning Meets Its Laws

Paper • 2512.17901 • Published Dec 19, 2025 • 62 •

commented 2 papers 6 months ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published Dec 2, 2025 • 268 •

Deep Research: A Systematic Survey

Paper • 2512.02038 • Published Nov 24, 2025 • 73 •

commented 8 papers 7 months ago

The Denario project: Deep knowledge AI agents for scientific discovery

Paper • 2510.26887 • Published Oct 30, 2025 • 8 •

INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats

Paper • 2510.25602 • Published Oct 29, 2025 • 80 •

Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model

Paper • 2510.18855 • Published Oct 21, 2025 • 73 •

Yury Panikov

AI & ML interests

Recent Activity

Organizations

panikov's activity