AI Can Write Code Like Humans-Bugs and All - WIRED

AI tools that help developers write code, like GitHub's Copilot, can generate code with bugs and flaws similarly to humans. Researchers found around 40% of code generated by Copilot for certain security-related tasks contained vulnerabilities. While helpful for reducing mundane work, these AI tools also highlight issues with current techniques, as the systems can lack understanding of code context and purpose. Developers must carefully review and test code produced by such tools to identify and address errors.

Uploaded by

is banz

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

88 views5 pages

AI Can Write Code Like Humans-Bugs and All - WIRED

Uploaded by

is banz

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

WILL KNIGHT BUSINESS 09.20.

2021 07:00 AM

AI Can Write Code Like Humans—Bugs and All

New tools that help developers write software also generate similar mistakes.

ILLUSTRATION: ELENA LACEY

The AI Database →

APPLICATION: TEXT GENERATION SECTOR: IT SOURCE DATA: TEXT

TECHNOLOGY: NATURAL LANGUAGE PROCESSING , MACHINE LEARNING

S O M E S O F T WA R E D E V E LO P E R S are now letting artificial intelligence help write their code. They’re finding that AI is

just as flawed as humans.

Last June, GitHub, a subsidiary of Microsoft that provides tools for hosting and collaborating on code, released a beta
version of a program that uses AI to assist programmers. Start typing a command, a database query, or a request to an
API, and the program, called Copilot, will guess your intent and write the rest.

Alex Naka, a data scientist at a biotech firm who signed up to test Copilot, says the program can be very helpful, and it
has changed the way he works. “It lets me spend less time jumping to the browser to look up API docs or examples on
Stack Overflow,” he says. “It does feel a little like my work has shifted from being a generator of code to being a
discriminator of it.”
But Naka has found that errors can creep into his code in different ways. “There have been times where I've missed some
kind of subtle error when I accept one of its proposals,” he says. “And it can be really hard to track this down, perhaps
because it seems like it makes errors that have a different flavor than the kind I would make.”

The risks of AI generating faulty code may be surprisingly high. Researchers at NYU recently analyzed code generated by
Copilot and found that, for certain tasks where security is crucial, the code contains security flaws around 40 percent of
the time.

The figure “is a little bit higher than I would have expected,” says Brendan Dolan-Gavitt, a professor at NYU involved with
the analysis. “But the way Copilot was trained wasn’t actually to write good code—it was just to produce the kind of text
that would follow a given prompt.”

Despite such flaws, Copilot and similar AI-powered tools may herald a sea change in the way software developers write
code. There’s growing interest in using AI to help automate more mundane work. But Copilot also highlights some of the
pitfalls of today’s AI techniques.

“It seems like it makes errors that have a different flavor than the kind I would make.”

— ALEX NAKA, DATA SCIENTIST

While analyzing the code made available for a Copilot plugin, Dolan-Gavitt found that it included a list of restricted
phrases. These were apparently introduced to prevent the system from blurting out offensive messages or copying well-
known code written by someone else.

Oege de Moor, vice president of research at GitHub and one of the developers of Copilot, says security has been a
concern from the start. He says the percentage of flawed code cited by the NYU researchers is only relevant for a subset
of code where security flaws are more likely.

De Moor invented CodeQL, a tool used by the NYU researchers that automatically identifies bugs in code. He says GitHub
recommends that developers use Copilot together with CodeQL to ensure their work is safe.

The GitHub program is built on top of an AI model developed by OpenAI, a prominent AI company doing cutting-edge
work in machine learning. That model, called Codex, consists of a large artificial neural network trained to predict the
next characters in both text and computer code. The algorithm ingested billions of lines of code stored on GitHub—not all
of it perfect—in order to learn how to write code.

Keep Reading
Search our artificial intelligence database and discover stories by sector, tech, company, and more.

OpenAI has built its own AI coding tool on top of Codex that can perform some stunning coding tricks. It can turn a typed
instruction, such as “Create an array of random variables between 1 and 100 and then return the largest of them,” into
working code in several programming languages.

Another version of the same OpenAI program, called GPT-3, can generate coherent text on a given subject, but it can also
regurgitate offensive or biased language learned from the darker corners of the web.

Copilot and Codex have led some developers to wonder if AI might automate them out of work. In fact, as Naka’s
experience shows, developers need considerable skill to use the program, as they often must vet or tweak its suggestions.

Hammond Pearce, a postdoctoral researcher at NYU involved with the analysis of Copilot code, says the program
sometimes produces problematic code because it doesn’t fully understand what a piece of code is trying to do.
“Vulnerabilities are often caused by a lack of context that a developer needs to know,” he says.
Some developers worry that AI is already picking up bad habits. “We have worked hard as an industry to get away from
copy-pasting solutions, and now Copilot has created a supercharged version of that,” says Maxim Khailo, a software
developer who has experimented with using AI to generate code but has not tried Copilot.

See What’s Next in Tech With the Fast Forward Newsletter

From artificial intelligence and self-driving cars to transformed cities and new startups, sign up for the latest news.
Your email

Enter your email

SUBMIT

By signing up you agree to our User Agreement and Privacy Policy & Cookie Statement

Khailo says it might be possible for hackers to mess with a program like Copilot. “If I was a bad actor, what I would do
would be to create vulnerable code projects on GitHub, artificially boost their popularity by buying GitHub stars on the
black market, and hope that it will become part of the corpus for the next training round.”

Both GitHub and OpenAI say that, on the contrary, their AI coding tools are only likely to become less error prone.
OpenAI says it vets projects and code both manually and using automated tools.

De Moor at GitHub says recent updates to Copilot should have reduced the frequency of security vulnerabilities. But he
adds that his team is exploring other ways of improving the output of Copilot. One is to remove bad examples that the
underlying AI model learns from. Another may be to use reinforcement learning, an AI technique that has produced some
impressive results in games and other areas, to automatically spot bad output, including previously unseen examples.
“Enormous improvements are happening,” he says. “It’s almost unimaginable what it will look like in a year.”

Get WIRED for $29.99 $5 SUBSCRIBE

Plus, free stickers!

Vibe Coding - The Future of Programming - Addy Osmani
100% (1)
Vibe Coding - The Future of Programming - Addy Osmani
47 pages
GitHub 101 - Copilot Intro
No ratings yet
GitHub 101 - Copilot Intro
40 pages
The Impact of AI Powered Code Completion in The Software Engineering Field
No ratings yet
The Impact of AI Powered Code Completion in The Software Engineering Field
18 pages
Ai Tools
No ratings yet
Ai Tools
5 pages
Github Copilot Training Agenda 2025
No ratings yet
Github Copilot Training Agenda 2025
2 pages
Kantek DP
No ratings yet
Kantek DP
100 pages
Asleep at The Keyboard? Assessing The Security of Github Copilot'S Code Contributions
No ratings yet
Asleep at The Keyboard? Assessing The Security of Github Copilot'S Code Contributions
15 pages
Poisoned ChatGPT
No ratings yet
Poisoned ChatGPT
19 pages
Evaluating The Code Quality of Ai-Assisted Code Generation Tools: An Empirical Study On Github Copilot, Amazon Codewhisperer, and Chatgpt
No ratings yet
Evaluating The Code Quality of Ai-Assisted Code Generation Tools: An Empirical Study On Github Copilot, Amazon Codewhisperer, and Chatgpt
45 pages
Github Copilot Ai Pair Programmer: Asset or Liability?
No ratings yet
Github Copilot Ai Pair Programmer: Asset or Liability?
20 pages
Is Github'S Copilot As Bad As Humans at Introducing Vulnerabilities in Code?
No ratings yet
Is Github'S Copilot As Bad As Humans at Introducing Vulnerabilities in Code?
24 pages
Thesis, Not Printed, AI Supported Software Development Moving Beyond Code Completion
No ratings yet
Thesis, Not Printed, AI Supported Software Development Moving Beyond Code Completion
81 pages
Is GitHub's Copilot As Bad As Humans at
No ratings yet
Is GitHub's Copilot As Bad As Humans at
28 pages
Do Users Write More Insecure Code With AI Assistants?: Neil Perry Megha Srivastava Deepak Kumar Dan Boneh
No ratings yet
Do Users Write More Insecure Code With AI Assistants?: Neil Perry Megha Srivastava Deepak Kumar Dan Boneh
16 pages
Gitub Copilot
No ratings yet
Gitub Copilot
27 pages
AI Assisted Coding
No ratings yet
AI Assisted Coding
14 pages
Zhang 2023 DPC
No ratings yet
Zhang 2023 DPC
20 pages
1 s2.0 S0164121224002486 Main
No ratings yet
1 s2.0 S0164121224002486 Main
17 pages
How To Use GitHub Copilot - Prompts, Tips, and Use Cases
No ratings yet
How To Use GitHub Copilot - Prompts, Tips, and Use Cases
4 pages
Copilot-GitHub Copilot - Wikipedia
No ratings yet
Copilot-GitHub Copilot - Wikipedia
5 pages
Github Copilot: Your Ai Pair Programmer
No ratings yet
Github Copilot: Your Ai Pair Programmer
29 pages
Taking Flight With Copilot: Early Insights and Opportunities of AI-powered Pair-Programming Tools
No ratings yet
Taking Flight With Copilot: Early Insights and Opportunities of AI-powered Pair-Programming Tools
23 pages
Assessing The Quality of GitHub Copilot's Code Generation
No ratings yet
Assessing The Quality of GitHub Copilot's Code Generation
10 pages
Security of AI Generated Code
No ratings yet
Security of AI Generated Code
18 pages
AI Is The Supply Chain Whitepaper
No ratings yet
AI Is The Supply Chain Whitepaper
11 pages
4 Margad
No ratings yet
4 Margad
16 pages
Jellyfish-The State of AI Coding Assistants
No ratings yet
Jellyfish-The State of AI Coding Assistants
16 pages
2024 Harding
No ratings yet
2024 Harding
24 pages
Conversing With Copilot: Exploring Prompt Engineering For Solving CS1 Problems Using Natural Language
No ratings yet
Conversing With Copilot: Exploring Prompt Engineering For Solving CS1 Problems Using Natural Language
7 pages
1997 - 999 - DOC - Introduction To AI-Powered
No ratings yet
1997 - 999 - DOC - Introduction To AI-Powered
4 pages
Co Pilot Content
No ratings yet
Co Pilot Content
88 pages
Reading 2.0
No ratings yet
Reading 2.0
3 pages
AI Code Generators Article - Part 2 0623
No ratings yet
AI Code Generators Article - Part 2 0623
4 pages
Coding On Copilot 2024 Developer Research
No ratings yet
Coding On Copilot 2024 Developer Research
24 pages
Sans Do AI Coding Assistants Make Bad Coders Worse Hannaford
No ratings yet
Sans Do AI Coding Assistants Make Bad Coders Worse Hannaford
44 pages
The Rise of AI Code Assistants
No ratings yet
The Rise of AI Code Assistants
3 pages
GLA - Getting Started With GitHub Copilot
No ratings yet
GLA - Getting Started With GitHub Copilot
12 pages
Saki Imai - Is GitHub Copilot A Substitute For Human Pair-Programming - An Empirical Study
No ratings yet
Saki Imai - Is GitHub Copilot A Substitute For Human Pair-Programming - An Empirical Study
3 pages
Enhancing Security in Industrial Application Development - Case Study On Self - Generating Artificial Intelligence Tools
No ratings yet
Enhancing Security in Industrial Application Development - Case Study On Self - Generating Artificial Intelligence Tools
18 pages
Co Pilot
No ratings yet
Co Pilot
1 page
CodeA11y Making AI Coding Assistants Useful For Accessible Web Development
No ratings yet
CodeA11y Making AI Coding Assistants Useful For Accessible Web Development
15 pages
Latest Trends Sep 2023
No ratings yet
Latest Trends Sep 2023
6 pages
AI-Driven Development Is Here - Should You Worry?
No ratings yet
AI-Driven Development Is Here - Should You Worry?
6 pages
Draft LWF
No ratings yet
Draft LWF
4 pages
Traditional Approach VS OO Approach
100% (16)
Traditional Approach VS OO Approach
17 pages
SV Assertions Lec3
No ratings yet
SV Assertions Lec3
14 pages
Bash Emacs Editing Mode (Readline) Cheat Sheet
100% (10)
Bash Emacs Editing Mode (Readline) Cheat Sheet
2 pages
9608 Computer Science Example Candidate Responses
No ratings yet
9608 Computer Science Example Candidate Responses
77 pages
Brochure - CMU - Programming With Python - 07-June-2023 - V26
No ratings yet
Brochure - CMU - Programming With Python - 07-June-2023 - V26
12 pages
Exploratory Factor Analysis Concepts and Theory
No ratings yet
Exploratory Factor Analysis Concepts and Theory
9 pages
Bubble Sorting: Data Structures
No ratings yet
Bubble Sorting: Data Structures
3 pages
Shell Programming Module2 Part2
No ratings yet
Shell Programming Module2 Part2
120 pages
Day2 Python
No ratings yet
Day2 Python
3 pages
GBDK Manual
No ratings yet
GBDK Manual
411 pages
IGI Rules Guide v03
No ratings yet
IGI Rules Guide v03
161 pages
Opening Standby For Read - Write
No ratings yet
Opening Standby For Read - Write
7 pages
Brolly AI - Generative AI - Online Training
No ratings yet
Brolly AI - Generative AI - Online Training
13 pages
T1 Hayes SCAOverview
No ratings yet
T1 Hayes SCAOverview
210 pages
Bigdata 15cs82 Vtu Module 1 2 Notes PDF
No ratings yet
Bigdata 15cs82 Vtu Module 1 2 Notes PDF
49 pages
Tc6 XML v201 Technical
No ratings yet
Tc6 XML v201 Technical
80 pages
Policyframework
No ratings yet
Policyframework
24 pages
UI Builder Fundamentals Lab 1
No ratings yet
UI Builder Fundamentals Lab 1
23 pages
Chopra Rajiv Object Oriented Programming Using Scala and Java
No ratings yet
Chopra Rajiv Object Oriented Programming Using Scala and Java
21 pages
4+1 View Model of Software Architecture Presented By: Reham Alhejaili May, 1st
No ratings yet
4+1 View Model of Software Architecture Presented By: Reham Alhejaili May, 1st
28 pages
UNIT-1python Introduced
No ratings yet
UNIT-1python Introduced
13 pages
Design of Web-Based Cash Flow
No ratings yet
Design of Web-Based Cash Flow
17 pages
PDF To Excal
No ratings yet
PDF To Excal
10 pages
R - Tili-Maruza Matni - 11-Qism
No ratings yet
R - Tili-Maruza Matni - 11-Qism
11 pages
Software Assurance Maturity Model (SAMM)
No ratings yet
Software Assurance Maturity Model (SAMM)
5 pages
Cascading Style Sheets (CSS)
No ratings yet
Cascading Style Sheets (CSS)
10 pages
CCHS MultiUserManual EN
No ratings yet
CCHS MultiUserManual EN
18 pages
Sharepoint Permissions: Sharepoint Security Best Practice
No ratings yet
Sharepoint Permissions: Sharepoint Security Best Practice
17 pages
Power Off Reset Reason
No ratings yet
Power Off Reset Reason
5 pages
Applications: David Fowler @davidfowl Damian Edwards @damianedwards
No ratings yet
Applications: David Fowler @davidfowl Damian Edwards @damianedwards
40 pages
Learning Essential Linux Commands For Navigating The Shell Effectively - Packt Hub
No ratings yet
Learning Essential Linux Commands For Navigating The Shell Effectively - Packt Hub
5 pages
MySQL CONTROL STATEMENTS
No ratings yet
MySQL CONTROL STATEMENTS
4 pages
Hani El Diaz Resume
No ratings yet
Hani El Diaz Resume
2 pages
Aparna CV
No ratings yet
Aparna CV
4 pages
15 AI Skills to Master in 2025
From Everand
15 AI Skills to Master in 2025
Nemilidinne Ashok Reddy
No ratings yet
Pragmatic AI Agents with the Gemini API
From Everand
Pragmatic AI Agents with the Gemini API
Harish Garg
No ratings yet
AI Jobs and Income
From Everand
AI Jobs and Income
Djordjevic Dragan
No ratings yet
Smart Internet of Things Projects
From Everand
Smart Internet of Things Projects
Agus Kurniawan
4/5 (2)
Mastering Prompt Engineering: Use AI Like a Hero
From Everand
Mastering Prompt Engineering: Use AI Like a Hero
Zakaria Bouidane
No ratings yet
Building AI Applications with Microsoft Semantic Kernel: Easily integrate generative AI capabilities and copilot experiences into your applications
From Everand
Building AI Applications with Microsoft Semantic Kernel: Easily integrate generative AI capabilities and copilot experiences into your applications
Lucas A. Meyer
No ratings yet
AI Primer For Business Leaders: Demystifying Generative AI
From Everand
AI Primer For Business Leaders: Demystifying Generative AI
James Rowe
5/5 (1)
Zero to AI: A Non-Technical Guide to Building Your First AI Product
From Everand
Zero to AI: A Non-Technical Guide to Building Your First AI Product
Dargslan
No ratings yet
Prompt Empire - Build Your AI-Powered Business From Scratch
From Everand
Prompt Empire - Build Your AI-Powered Business From Scratch
Zara Loop
No ratings yet
Learn IoT Programming Using Node-RED: Begin to Code Full Stack IoT Apps and Edge Devices with Raspberry Pi, NodeJS, and Grafana
From Everand
Learn IoT Programming Using Node-RED: Begin to Code Full Stack IoT Apps and Edge Devices with Raspberry Pi, NodeJS, and Grafana
Bernardo Ronquillo Japón
No ratings yet
The Beginner's Guide to GitHub Copilot
From Everand
The Beginner's Guide to GitHub Copilot
Steven Mcananey
No ratings yet
Chat GPT Prompt Engineering With Tech Trends: Tech trends, #1
From Everand
Chat GPT Prompt Engineering With Tech Trends: Tech trends, #1
ATHEER Mahir
No ratings yet
Generative AI Tools for Developers: A Practical Guide
From Everand
Generative AI Tools for Developers: A Practical Guide
Timi Omoyeni
No ratings yet
Mastering DeepSeek AI: Unlocking the Power of Next-Generation Artificial Intelligence
From Everand
Mastering DeepSeek AI: Unlocking the Power of Next-Generation Artificial Intelligence
Mustaque Mohammed
No ratings yet
AI Basics and The RGB Prompt Engineering Model: Empowering AI & ChatGPT Through Effective Prompt Engineering
From Everand
AI Basics and The RGB Prompt Engineering Model: Empowering AI & ChatGPT Through Effective Prompt Engineering
Phill Akinwale
No ratings yet
The Art of AI Business Analyst & Work
From Everand
The Art of AI Business Analyst & Work
Tom Henricksen
No ratings yet