Agents of Change: Self-Evolving LLM Agents for Strategic Planning Paper • 2506.04651 • Published 10 days ago • 6
Agents of Change: Self-Evolving LLM Agents for Strategic Planning Paper • 2506.04651 • Published 10 days ago • 6
THOUGHTTERMINATOR: Benchmarking, Calibrating, and Mitigating Overthinking in Reasoning Models Paper • 2504.13367 • Published Apr 17 • 24
Kaleidoscope: In-language Exams for Massively Multilingual Vision Evaluation Paper • 2504.07072 • Published Apr 9 • 9
Kaleidoscope: In-language Exams for Massively Multilingual Vision Evaluation Paper • 2504.07072 • Published Apr 9 • 9
view article Article LeMaterial: an open source initiative to accelerate materials discovery and research By AlexDuvalinho and 9 others • Dec 10, 2024 • 50
MultiAgent Collaboration Attack: Investigating Adversarial Attacks in Large Language Model Collaborations via Debate Paper • 2406.14711 • Published Jun 20, 2024 • 1
Knowledge of Knowledge: Exploring Known-Unknowns Uncertainty with Large Language Models Paper • 2305.13712 • Published May 23, 2023 • 2