Rethinking Verification for LLM Code Generation: From Generation to Testing Paper • 2507.06920 • Published about 16 hours ago • 12
CompassVerifier Collection CompassVerifier: A Unified and Robust Verifier for Large Language Models • 3 items • Updated about 18 hours ago • 1
CompassVerifier Collection CompassVerifier: A Unified and Robust Verifier for Large Language Models • 3 items • Updated about 18 hours ago • 1
Coding Triangle: How Does Large Language Model Understand Code? Paper • 2507.06138 • Published 1 day ago • 17
Deciphering Trajectory-Aided LLM Reasoning: An Optimization Perspective Paper • 2505.19815 • Published May 26 • 37
Creation-MMBench: Assessing Context-Aware Creative Intelligence in MLLM Paper • 2503.14478 • Published Mar 18 • 48
OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference Paper • 2502.18411 • Published Feb 25 • 73