WildScore: Benchmarking MLLMs in-the-Wild Symbolic Music Reasoning Paper • 2509.04744 • Published 4 days ago • 9
ReCode: Updating Code API Knowledge with Reinforcement Learning Paper • 2506.20495 • Published Jun 25 • 8
A Survey of LLM-Driven AI Agent Communication: Protocols, Security Risks, and Defense Countermeasures Paper • 2506.19676 • Published Jun 24