Pre-Trained Policy Discriminators are General Reward Models Paper • 2507.05197 • Published 6 days ago • 33
LongAnimation: Long Animation Generation with Dynamic Global-Local Memory Paper • 2507.01945 • Published 11 days ago • 72
Search and Refine During Think: Autonomous Retrieval-Augmented Reasoning of LLMs Paper • 2505.11277 • Published May 16 • 8
From Real to Synthetic: Synthesizing Millions of Diversified and Complicated User Instructions with Attributed Grounding Paper • 2506.03968 • Published Jun 4 • 16
DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents Paper • 2506.11763 • Published 30 days ago • 63
DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents Paper • 2506.11763 • Published 30 days ago • 63 • 4
DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents Paper • 2506.11763 • Published 30 days ago • 63