ComfyUI-R1: Exploring Reasoning Models for Workflow Generation Paper • 2506.09790 • Published 29 days ago • 52
Saffron-1: Towards an Inference Scaling Paradigm for LLM Safety Assurance Paper • 2506.06444 • Published Jun 6 • 73
DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents Paper • 2506.11763 • Published 27 days ago • 63
Agentic Reasoning: Reasoning LLMs with Tools for the Deep Research Paper • 2502.04644 • Published Feb 7 • 2
Deep Research Agents: A Systematic Examination And Roadmap Paper • 2506.18096 • Published 18 days ago • 1
Can LLMs Identify Critical Limitations within Scientific Research? A Systematic Evaluation on AI Research Papers Paper • 2507.02694 • Published 7 days ago • 18
Disambiguation-Centric Finetuning Makes Enterprise Tool-Calling LLMs More Realistic and Less Risky Paper • 2507.03336 • Published 6 days ago • 3