zhuww
's Collections
Vibe Coding vs. Agentic Coding: Fundamentals and Practical Implications
of Agentic AI
Paper
•
2505.19443
•
Published
•
15
Skywork-SWE: Unveiling Data Scaling Laws for Software Engineering in
LLMs
Paper
•
2506.19290
•
Published
•
52
CodeNet: A Large-Scale AI for Code Dataset for Learning a Diversity of
Coding Tasks
Paper
•
2105.12655
•
Published
StarCoder 2 and The Stack v2: The Next Generation
Paper
•
2402.19173
•
Published
•
148
SWE-Synth: Synthesizing Verifiable Bug-Fix Data to Enable Large Language
Models in Resolving Real-World Bugs
Paper
•
2504.14757
•
Published
OctoPack: Instruction Tuning Code Large Language Models
Paper
•
2308.07124
•
Published
•
30
rStar-Coder: Scaling Competitive Code Reasoning with a Large-Scale
Verified Dataset
Paper
•
2505.21297
•
Published
•
30
Developer-LLM Conversations: An Empirical Study of Interactions and
Generated Code Quality
Paper
•
2509.10402
•
Published
•
4
WizardCoder: Empowering Code Large Language Models with Evol-Instruct
Paper
•
2306.08568
•
Published
•
28
Magicoder: Source Code Is All You Need
Paper
•
2312.02120
•
Published
•
81
Granite Code Models: A Family of Open Foundation Models for Code
Intelligence
Paper
•
2405.04324
•
Published
•
25
Knowledge Transfer from High-Resource to Low-Resource Programming
Languages for Code LLMs
Paper
•
2308.09895
•
Published
•
1
OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models
Paper
•
2411.04905
•
Published
•
127
OpenCodeInterpreter: Integrating Code Generation with Execution and
Refinement
Paper
•
2402.14658
•
Published
•
83
Infinity Instruct: Scaling Instruction Selection and Synthesis to
Enhance Language Models
Paper
•
2506.11116
•
Published
•
4
Thinking LLMs: General Instruction Following with Thought Generation
Paper
•
2410.10630
•
Published
•
21
KodCode: A Diverse, Challenging, and Verifiable Synthetic Dataset for
Coding
Paper
•
2503.02951
•
Published
•
33
SWE-QA: Can Language Models Answer Repository-level Code Questions?
Paper
•
2509.14635
•
Published
•
36
CodeDPO: Aligning Code Models with Self Generated and Verified Source
Code
Paper
•
2410.05605
•
Published
•
1
CodeSteer: Symbolic-Augmented Language Models via Code/Text Guidance
Paper
•
2502.04350
•
Published
•
11
Genetic Instruct: Scaling up Synthetic Generation of Coding Instructions
for Large Language Models
Paper
•
2407.21077
•
Published
•
2
OpenCodeReasoning: Advancing Data Distillation for Competitive Coding
Paper
•
2504.01943
•
Published
•
15
Training Long-Context, Multi-Turn Software Engineering Agents with
Reinforcement Learning
Paper
•
2508.03501
•
Published
•
56
Dream-Coder 7B: An Open Diffusion Language Model for Code
Paper
•
2509.01142
•
Published
PromptCoT 2.0: Scaling Prompt Synthesis for Large Language Model
Reasoning
Paper
•
2509.19894
•
Published
•
31
CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction
Paper
•
2502.07316
•
Published
•
50
Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs
with Nothing
Paper
•
2406.08464
•
Published
•
71
BigCodeArena: Unveiling More Reliable Human Preferences in Code
Generation via Execution
Paper
•
2510.08697
•
Published
•
28
Critique-Coder: Enhancing Coder Models by Critique Reinforcement
Learning
Paper
•
2509.22824
•
Published
•
20