BigCodeBench: Benchmarking Large Language Models on Solving Practical and Challenging Programming Tasks Jun 18, 2024 • 46
SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models Paper • 2502.09604 • Published Feb 13 • 34
Granite Vision: a lightweight, open-source multimodal model for enterprise Intelligence Paper • 2502.09927 • Published Feb 14
CodeArena: A Collective Evaluation Platform for LLM Code Generation Paper • 2503.01295 • Published 22 days ago • 8
Rethinking the Influence of Source Code on Test Case Generation Paper • 2409.09464 • Published Sep 14, 2024 • 1
CodeArena: A Collective Evaluation Platform for LLM Code Generation Paper • 2503.01295 • Published 22 days ago • 8