Dive into the Agent Matrix: A Realistic Evaluation of Self-Replication Risk in LLM Agents Paper • 2509.25302 • Published Sep 29 • 1
TENET: Leveraging Tests Beyond Validation for Code Generation Paper • 2509.24148 • Published Sep 29 • 3
Taming the Chaos: Coordinated Autoscaling for Heterogeneous and Disaggregated LLM Inference Paper • 2508.19559 • Published Aug 27 • 6
Symbol Preference Aware Generative Models for Recovering Variable Names from Stripped Binary Paper • 2306.02546 • Published Jun 5, 2023 • 1
RepoAudit: An Autonomous LLM-Agent for Repository-Level Code Auditing Paper • 2501.18160 • Published Jan 30 • 2
LLMDFA: Analyzing Dataflow in Code with Large Language Models Paper • 2402.10754 • Published Feb 16, 2024 • 1
ASTRA: Autonomous Spatial-Temporal Red-teaming for AI Software Assistants Paper • 2508.03936 • Published Aug 5 • 9
$μ$KE: Matryoshka Unstructured Knowledge Editing of Large Language Models Paper • 2504.01196 • Published Apr 1
ProSec: Fortifying Code LLMs with Proactive Security Alignment Paper • 2411.12882 • Published Nov 19, 2024 • 2
MaPPO: Maximum a Posteriori Preference Optimization with Prior Knowledge Paper • 2507.21183 • Published Jul 27 • 14
SePPO: Semi-Policy Preference Optimization for Diffusion Alignment Paper • 2410.05255 • Published Oct 7, 2024 • 5