Goto

Collaborating Authors

 alphacode


Programming with AI: Evaluating ChatGPT, Gemini, AlphaCode, and GitHub Copilot for Programmers

Siam, Md Kamrul, Gu, Huanying, Cheng, Jerry Q.

arXiv.org Artificial Intelligence

Our everyday lives now heavily rely on artificial intelligence (AI) powered large language models (LLMs). Like regular users, programmers are also benefiting from the newest large language models. In response to the critical role that AI models play in modern software development, this study presents a thorough evaluation of leading programming assistants, including ChatGPT, Gemini(Bard AI), AlphaCode, and GitHub Copilot. The evaluation is based on tasks like natural language processing and code generation accuracy in different programming languages like Java, Python and C++. Based on the results, it has emphasized their strengths and weaknesses and the importance of further modifications to increase the reliability and accuracy of the latest popular models. Although these AI assistants illustrate a high level of progress in language understanding and code generation, along with ethical considerations and responsible usage, they provoke a necessity for discussion. With time, developing more refined AI technology is essential for achieving advanced solutions in various fields, especially with the knowledge of the feature intricacies of these models and their implications. This study offers a comparison of different LLMs and provides essential feedback on the rapidly changing area of AI models. It also emphasizes the need for ethical developmental practices to actualize AI models' full potential.


Natural Language Programming AIs are taking the drudgery out of coding

Engadget

That three-word pejorative is perpetually on the lips and at the fingertips of internet trolls and tech bros whenever media layoffs are announced. A useless sentiment in its own right, but with the recent advent of code generating AIs, knowing the ins and outs of a programming language like Python could soon be about as useful as knowing how to fluently speak a dead language like Sanskrit. In fact, these genAIs are already helping professional software developers code faster and more effectively by handling much of the programming grunt work. Two of today's most widely distributed and written coding languages are Java and Python. The former almost single handedly revolutionized cross-platform operation when it was released in the mid-'90s and now drives "everything from smartcards to space vehicles," according to Java Magazine in 2020 -- not to mention Wikipedia's search function and all of Minecraft.


Large Language Models Meet NL2Code: A Survey

Zan, Daoguang, Chen, Bei, Zhang, Fengji, Lu, Dianjie, Wu, Bingchao, Guan, Bei, Wang, Yongji, Lou, Jian-Guang

arXiv.org Artificial Intelligence

The task of generating code from a natural language description, or NL2Code, is considered a pressing and significant challenge in code intelligence. Thanks to the rapid development of pre-training techniques, surging large language models are being proposed for code, sparking the advances in NL2Code. To facilitate further research and applications in this field, in this paper, we present a comprehensive survey of 27 existing large language models for NL2Code, and also review benchmarks and metrics. We provide an intuitive comparison of all existing models on the HumanEval benchmark. Through in-depth observation and analysis, we provide some insights and conclude that the key factors contributing to the success of large language models for NL2Code are "Large Size, Premium Data, Expert Tuning". In addition, we discuss challenges and opportunities regarding the gap between models and humans. We also create a website https://nl2code.github.io to track the latest progress through crowd-sourcing. To the best of our knowledge, this is the first survey of large language models for NL2Code, and we believe it will contribute to the ongoing development of the field.


Updated Outlook of the AI Software Development Career Landscape

#artificialintelligence

AI technology is one of the fastest-growing industries in the world. One poll found that 35% of companies currently use AI and another 42% intend to use it in the future. Not only are AI software developer jobs ubiquitous, but they are also well paying. Programming salaries routinely rank among the top 10-15% in the United States. In this article, we take a look at what the AI software development landscape looks like in 2023.


Will ChatGPT Replace Developers? - DevOps.com

#artificialintelligence

AI is buzzing again thanks to the recent release of ChatGPT, a natural language chatbot that people are using to write emails, poems, song lyrics and college essays. Early adopters have even used it to write Python code, as well as to reverse engineer shellcode and rewrite it in C. ChatGPT has sparked hope among people eager for the arrival of practical applications of AI, but it also begs the question of whether it will displace writers and developers in the same way robots and computers have replaced some cashiers, assembly-line workers and, perhaps in the future, taxi drivers. It's hard to say how sophisticated the AI text-creation capabilities will be in the future as the technology ingests more and more examples of our online writing. But I see it having very limited capabilities for programming. If anything, it could end up being just another tool in the developer's kit to handle tasks that don't take the critical thinking skills software engineers bring to the table.


DeepMind Builds AI That Codes as Well as the Average Human Programmer - ExtremeTech

#artificialintelligence

While machine learning has advanced by leaps and bounds, it's hard to create an AI that's good at more than one thing. So, a machine could be trained with data to handle one class of programming challenges, but it would fail when given a different problem to tackle. So, the team decided to skip all the training on algorithms and code structure, instead treating it more like a translation problem. Programming challenges usually include a description of the task, and the resulting code submitted by a human participant is technically just an expression of the description. The AI works in two phases: It takes the description and converts it to an internal representation.


AlphaCode can solve complex problems and create code using AI

#artificialintelligence

A novel system called AlphaCode uses artificial intelligence (AI) to create computer code, and has recently participated in programming competitions, using critical thinking, algorithms, and natural language comprehension. The AI system performed extremely well in competitions. AlphaCode is an AI software system created by DeepMind, a subsidiary of the company Alphabet, the parent company of Google. The software generates code in Python or C, while filtering out any bad coding. It has the ability to generate code at an exceptional rate.


AI's next frontier: AlphaCode can match programming prowess of average coders

#artificialintelligence

Artificial intelligence software programs are becoming shockingly adept at carrying on conversations, winning board games and generating artwork -- but what about creating software programs? In a newly published paper, researchers at Google DeepMind say their AlphaCode program can keep up with the average human coder in standardized programming contests. "This result marks the first time an artificial intelligence system has performed competitively in programming contests," the researchers report in this week's issue of the journal Science. There's no need to sound the alarm about Skynet just yet: DeepMind's code-generating system earned an average ranking in the top 54.3% in simulated evaluations on recent programming competitions on the Codeforces platform -- which is a very "average" average. "Competitive programming is an extremely difficult challenge, and there's a massive gap between where we are now (solving around 30% of problems in 10 submissions) and top programmers (solving 90% of problems in a single submission)," DeepMind research scientist Yujia Li, one of the Science paper's principal authors, told GeekWire in an email.


Programming Is Hard -- Or at Least It Used to Be: Educational Opportunities And Challenges of AI Code Generation

Becker, Brett A., Denny, Paul, Finnie-Ansley, James, Luxton-Reilly, Andrew, Prather, James, Santos, Eddie Antonio

arXiv.org Artificial Intelligence

The introductory programming sequence has been the focus of much research in computing education. The recent advent of several viable and freely-available AI-driven code generation tools present several immediate opportunities and challenges in this domain. In this position paper we argue that the community needs to act quickly in deciding what possible opportunities can and should be leveraged and how, while also working on how to overcome or otherwise mitigate the possible challenges. Assuming that the effectiveness and proliferation of these tools will continue to progress rapidly, without quick, deliberate, and concerted efforts, educators will lose advantage in helping shape what opportunities come to be, and what challenges will endure. With this paper we aim to seed this discussion within the computing education community.


Will DeepMind's AlphaCode Replace Programmers? - KDnuggets

#artificialintelligence

The Alphabet subsidiary DeepMind has done it again, and this time, they are testing the boundaries of AI in software development sectors. DeepMind's AlphaCode was tested against human performance on coding challenges and achieved rank among the top 54% of human coders on Codeforces. This is a remarkable achievement as it is one of its kind. There are other code generation machine learning models, such as OpenAI Codex, but none of them tried to compete with human programmers. A coding challenge is like solving puzzles. To solve these challenges, an individual must have an understanding of logic, math, and programming skills.