On Path to Multimodal Generalist: General-Level and General-Bench Paper • 2505.04620 • Published 8 days ago • 72
Multi-Agent System for Comprehensive Soccer Understanding Paper • 2505.03735 • Published 9 days ago • 20
Agentic Reasoning and Tool Integration for LLMs via Reinforcement Learning Paper • 2505.01441 • Published 17 days ago • 35
Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers Paper • 2504.20752 • Published 16 days ago • 86
Spatial Speech Translation: Translating Across Space With Binaural Hearables Paper • 2504.18715 • Published 19 days ago • 7
DeepCritic: Deliberate Critique with Large Language Models Paper • 2505.00662 • Published 14 days ago • 49
Llama-3.1-FoundationAI-SecurityLLM-Base-8B Technical Report Paper • 2504.21039 • Published 17 days ago • 15
Beyond the Last Answer: Your Reasoning Trace Uncovers More than You Think Paper • 2504.20708 • Published 16 days ago • 22
CipherBank: Exploring the Boundary of LLM Reasoning Capabilities through Cryptography Challenges Paper • 2504.19093 • Published 18 days ago • 16
Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning Paper • 2504.17192 • Published 21 days ago • 108
A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment Paper • 2504.15585 • Published 23 days ago • 13
LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities Paper • 2504.16078 • Published 23 days ago • 20
BookWorld: From Novels to Interactive Agent Societies for Creative Story Generation Paper • 2504.14538 • Published 25 days ago • 28