Asleep at the Keyboard? Assessing the Security of GitHub Copilot's Code Contributions

Jan-21-2025, 18:42:58 GMT–Communications of the ACM

There is burgeoning interest in designing AI-based systems to assist humans in designing computing systems, including tools that automatically generate computer code. The most notable of these comes in the form of the first self-described "AI pair programmer," GitHub Copilot, a language model trained over open source GitHub code. However, code often contains bugs--and so, given the vast quantity of unvetted code that Copilot has processed, it is certain that the language model will have learned from exploitable, buggy code. This raises concerns on the security of Copilot's code contributions. In this work, we systematically investigate the prevalence and conditions that can cause GitHub Copilot to recommend insecure code.

copilot, large language model, natural language, (17 more...)

Communications of the ACM

Jan-21-2025, 18:42:58 GMT

Journals Web Page

Add feedback

Industry:
- Information Technology > Security & Privacy (0.32)

Technology:
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.94)