VeriCoder: Enhancing LLM-Based RTL Code Generation through Functional Correctness Validation Paper • 2504.15659 • Published Apr 22
Improving Assembly Code Performance with Large Language Models via Reinforcement Learning Paper • 2505.11480 • Published 26 days ago • 8
SATBench: Benchmarking LLMs' Logical Reasoning via Automated Puzzle Generation from SAT Formulas Paper • 2505.14615 • Published 22 days ago • 1
CoRNStack: High-Quality Contrastive Data for Better Code Ranking Paper • 2412.01007 • Published Dec 1, 2024
CodeARC: Benchmarking Reasoning Capabilities of LLM Agents for Inductive Program Synthesis Paper • 2503.23145 • Published Mar 29 • 34