CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning

Jan-16-2025, 22:59:20 GMT–Neural Information Processing Systems

Program synthesis or code generation aims to generate a program that satisfies a problem specification. Recent approaches using large-scale pretrained language models (LMs) have shown promising results, yet they have some critical limitations. In particular, they often follow a standard supervised fine-tuning procedure to train a code generation model from natural language problem descriptions and ground-truth programs only. Such paradigm largely ignores some important but potentially useful signals in the problem specification such as unit tests, which thus results in poor performance when solving complex unseen coding tasks. We propose "CodeRL" to address the limitations, a new framework for program synthesis tasks through pretrained LMs and deep reinforcement learning (RL).

artificial intelligence, machine learning, natural language, (10 more...)

Neural Information Processing Systems

Jan-16-2025, 22:59:20 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Reinforcement Learning (1.00)
  - Representation & Reasoning > Automatic Programming (0.88)