Generative AI
Top 10 AI Innovations Of 2021 So Far
AI is a complex and ever-evolving field where organisations and individuals are constantly focused 0n finding novel solutions to pressing challenges. The year has been full of path-breaking innovations which have pushed the boundaries and made way for better outcomes. In this article, we list the top ten AI innovations of 2021 so far. OpenAI and Microsoft's GitHub Copilot is an AI-based tool for programmers to write better code. The programmer can describe a function to the Copilot in plain English as a comment, and the machine will convert it to actual code.
OpenAI's New Code Generator: GitHub Copilot (and Codex)
I explain Artificial Intelligence terms and news to non-experts. You've probably heard of the recent Copilot tool by GitHub, which generates code for you. You can see this tool as an auto-complete for code. You give it the name of a function along with some additional info, and it generates the code for you quite accurately! Rather, it will try to understand what you are trying to do to generate it.
GitHub Copilot And The Unfulfilled Promises Of An Artificial Intelligence Future
In late June of 2021, GitHub launched a'technical preview' of what they termed GitHub Copilot, described as an'AI pair programmer which helps you write better code'. Quite predictably, responses to this announcement varied from glee at the glorious arrival of our code-generating AI overlords, to dismay and predictions of doom and gloom as before long companies would be firing software developers en-masse. As is usually the case with such controversial topics, neither of these extremes are even remotely close to the truth. In fact, the OpenAI Codex machine learning model which underlies GitHub's Copilot is derived from OpenAI's GPT-3 natural language model, and features many of the same stumbles and gaffes which GTP-3 has. So if Codex and with it Copilot isn't everything it's cracked up to be, what is the big deal, and why show it at all?
The Devil is in the GAN: Defending Deep Generative Models Against Backdoor Attacks
Rawat, Ambrish, Levacher, Killian, Sinn, Mathieu
Deep Generative Models (DGMs) allow users to synthesize data from complex, high-dimensional manifolds. Industry applications of DGMs include data augmentation to boost performance of (semi-)supervised machine learning, or to mitigate fairness or privacy concerns. Large-scale DGMs are notoriously hard to train, requiring expert skills, large amounts of data and extensive computational resources. Thus, it can be expected that many enterprises will resort to sourcing pre-trained DGMs from potentially unverified third parties, e.g.~open source model repositories. As we show in this paper, such a deployment scenario poses a new attack surface, which allows adversaries to potentially undermine the integrity of entire machine learning development pipelines in a victim organization. Specifically, we describe novel training-time attacks resulting in corrupted DGMs that synthesize regular data under normal operations and designated target outputs for inputs sampled from a trigger distribution. Depending on the control that the adversary has over the random number generation, this imposes various degrees of risk that harmful data may enter the machine learning development pipelines, potentially causing material or reputational damage to the victim organization. Our attacks are based on adversarial loss functions that combine the dual objectives of attack stealth and fidelity. We show its effectiveness for a variety of DGM architectures (Generative Adversarial Networks (GANs), Variational Autoencoders (VAEs)) and data domains (images, audio). Our experiments show that - even for large-scale industry-grade DGMs - our attack can be mounted with only modest computational efforts. We also investigate the effectiveness of different defensive approaches (based on static/dynamic model and output inspections) and prescribe a practical defense strategy that paves the way for safe usage of DGMs.
AI-Memer: Using Machine Learning to Create Funny Memes
The user starts by entering a search query to find a background image, like "apple pie". The system then checks for matching images in Wikimedia Commons [1] and the OpenImages dataset [2]. Both datasets have corresponding text descriptions of the images. I use the CLIP [3] encoders from OpenAI to first perform a semantic search on the text descriptions. A semantic search looks for matching concepts, not just a word search.
Plans for Future Maintenance of Gym ยท Issue #2259 ยท openai/gym
Fixes to code style (use the same style tests as either PettingZoo does or SB3 does and that to CI tests once they're properly functioning) (Thanks @cclauss!) Removal of old and entirely unused code Bug fixes (they'll actually be merged now!) Useful non-breaking extensions to or entirely new action/observation spaces Built in API compliance testing (Similar to what PettingZoo has for environments and what SB3 added for Gym environments) Nonbreaking and useful additions of environment arguments, similar to what most third party Gym environments now have or what PettingZoo environments generally have by default (e.g. Lycon is a Python library that's just took the C image resizing logic from OpenCV and put it in it's own repo. This makes it run slightly faster, and more importantly it gets rid of all the horrifying installation issues associated with OpenCV (and most RL libraries only depended on OpenCV for this functionality). However, Lycon is no longer maintained and does not generate wheels with the C already compiled (though Ben Black added the logic for this- ethereon/lycon#25). Dealing with all flavors of MuJuCo problems (I am objectively not qualified for this) Create a new, in depth, documentation website.
An endlessly changing playground teaches AIs how to multitask
They advance to more complex multiplayer games like hide and seek or capture the flag, where teams compete to be the first to find and grab their opponent's flag. The playground manager has no specific goal but aims to improve the general capability of the players over time. AIs like DeepMind's AlphaZero have beaten the world's best human players at chess and Go. But they can only learn one game at a time. As DeepMind cofounder Shane Legg put it when I spoke to him last year, it's like having to swap out your chess brain for your Go brain each time you want to switch games.
OpenAI releases Triton, a programming language for AI workload optimization
All the sessions from Transform 2021 are available on-demand now. OpenAI today released Triton, an open source, Python-like programming language that enables researchers to write highly efficient GPU code for AI workloads. Triton makes it possible to reach peak hardware performance with relatively little effort, OpenAI claims, producing code on par with what an expert could achieve in as few as 25 lines. Deep neural networks have emerged as an important type of AI model, capable of achieving state-of-the-art performance across natural language processing, computer vision, and other domains. The strength of these models lies in their hierarchical structure, which generates a large amount of highly parallelizable work well-suited for multicore hardware like GPUs.
Vision Transformers: Natural Language Processing (NLP) Increases Efficiency and Model Generality
There has been no shortage of developments vying for a share of your attention over the last year or so. However, if you regularly follow the state of machine learning research you may recall a loud contender for a share of your mind in OpenAI's GPT-3 and accompanying business strategy development from the group. GPT-3 is the latest and by far the largest in OpenAI's general purpose transformer lineage working on models for natural language processing. Of course, GPT-3 and GPTs may grab headlines, but it belongs to a much larger superfamily of transformer models, including a plethora of variants based on the Bidirectional Encoder Representations from Transformers (BERT) family originally created by Google, as well as other smaller families of models from Facebook and Microsoft. For an expansive but still not exhaustive overview of major NLP transformers, the leading resource is probably the Apache 2.0 licensed Hugging Face () library.