SciEvalKit: An Open-source Evaluation Toolkit for Scientific General Intelligence Paper • 2512.22334 • Published Dec 26, 2025 • 35
COMPASS: A Framework for Evaluating Organization-Specific Policy Alignment in LLMs Paper • 2601.01836 • Published 29 days ago • 10
AI Meets Brain: Memory Systems from Cognitive Neuroscience to Autonomous Agents Paper • 2512.23343 • Published Dec 29, 2025 • 28
Diversity or Precision? A Deep Dive into Next Token Prediction Paper • 2512.22955 • Published Dec 28, 2025 • 8