Generative AI
AI Can Do Great Things--if It Doesn't Burn the Planet
Last month, researchers at OpenAI in San Francisco revealed an algorithm capable of learning, through trial and error, how to manipulate the pieces of a Rubik's Cube using a robotic hand. It was a remarkable research feat, but it required more than 1,000 desktop computers plus a dozen machines running specialized graphics chips crunching intensive calculations for several months. The effort may have consumed about 2.8 gigawatt-hours of electricity, estimates Evan Sparks, CEO of Determined AI, a startup that provides software to help companies manage AI projects. A spokesperson for OpenAI questioned the calculation, noting that it makes several assumptions. But OpenAI declined to disclose further details of the project or offer an estimate of the electricity it consumed.
Variational Mixture-of-Experts Autoencoders for Multi-Modal Deep Generative Models
Shi, Yuge, N, Siddharth, Paige, Brooks, Torr, Philip
Learning generative models that span multiple data modalities, such as vision and language, is often motivated by the desire to learn more useful, generalisable representations that faithfully capture common underlying factors between the modalities. In this work, we characterise successful learning of such models as the fulfilment of four criteria: i) implicit latent decomposition into shared and private subspaces, ii) coherent joint generation over all modalities, iii) coherent cross-generation across individual modalities, and iv) improved model learning for individual modalities through multi-modal integration. Here, we propose a mixture-of-experts multi-modal variational autoencoder (MMVAE) for learning of generative models on different sets of modalities, including a challenging image - language dataset, and demonstrate its ability to satisfy all four criteria, both qualitatively and quantitatively. Papers published at the Neural Information Processing Systems Conference.
Reward Engineering for Object Pick and Place Training
Nagpal, Raghav, Krishnan, Achyuthan Unni, Yu, Hanshen
Robotic grasping is a crucial area of research as it can result in the acceleration of the automation of several Industries utilizing robots ranging from manufacturing to healthcare. Reinforcement learning is the field of study where an agent learns a policy to execute an action by exploring and exploiting rewards from an environment. Reinforcement learning can thus be used by the agent to learn how to execute a certain task, in our case grasping an object. We have used the Pick and Place environment provided by OpenAI's Gym to engineer rewards. Hindsight Experience Replay (HER) has shown promising results with problems having a sparse reward. In the default configuration of the OpenAI baseline and environment the reward function is calculated using the distance between the target location and the robot end-effector. By weighting the cost based on the distance of the end-effector from the goal in the x,y and z-axes we were able to almost halve the learning time compared to the baselines provided by OpenAI, an intuitive strategy that further reduced learning time. In this project, we were also able to introduce certain user desired trajectories in the learnt policies (city-block / Manhattan trajectories). This helps us understand that by engineering the rewards we can tune the agent to learn policies in a certain way even if it might not be the most optimal but is the desired manner.
New Projects See GPT-2 Summarizing Movies, Playing Chess
New Netflix and Chess applications have once again illustrated the range and potential of OpenAI's GPT-2 language model. GPT-2 became an overnight sensation when OpenAI released it in February 2019, receiving critical acclaim worldwide. GPT-2 is a successor to GPT, a large transformer-based language model with 1.5 million parameters and trained on 8 million web pages. GPT-2 can automatically generate coherent paragraphs of text, basically predicting the next word given previous words. One of the big challenges for language models is domain-specific datasets that involve a lot of expert knowledge. That's GPT-2's charm point -- it seems free of such constraints, outperforming other language models on specific domains like Wikipedia, news, or books without using any domain-specific training datasets.
Granular Learning with Deep Generative Models using Highly Contaminated Data
An approach to utilize recent advances in deep generative models for anomaly detection in a granular (continuous) sense on a real-world image dataset with quality issues is detailed using recent normalizing flow models, with implications in many other applications/domains/data types. The approach is completely unsupervised (no annotations available) but qualitatively shown to provide accurate semantic labeling for images via heatmaps of the scaled log-likelihood overlaid on the images. When sorted based on the median values per image, clear trends in quality are observed. Furthermore, downstream classification is shown to be possible and effective via a weakly supervised approach using the log-likelihood output from a normalizing flow model as a training signal for a feature-extracting convolutional neural network. The pre-linear dense layer outputs on the CNN are shown to disentangle high level representations and efficiently cluster various quality issues. Thus, an entirely non-annotated (fully unsupervised) approach is shown possible for accurate estimation and classification of quality issues..
Variational Mixture-of-Experts Autoencoders for Multi-Modal Deep Generative Models
Shi, Yuge, N, Siddharth, Paige, Brooks, Torr, Philip
Learning generative models that span multiple data modalities, such as vision and language, is often motivated by the desire to learn more useful, generalisable representations that faithfully capture common underlying factors between the modalities. In this work, we characterise successful learning of such models as the fulfilment of four criteria: i) implicit latent decomposition into shared and private subspaces, ii) coherent joint generation over all modalities, iii) coherent cross-generation across individual modalities, and iv) improved model learning for individual modalities through multi-modal integration. Here, we propose a mixture-of-experts multimodal variational autoencoder (MMVAE) to learn generative models on different sets of modalities, including a challenging image language dataset, and demonstrate its ability to satisfy all four criteria, both qualitatively and quantitatively. Code, data, and models are provided at this url.
This Browser Extension 'GPTrue or False' Can Identify AI Written Content MarkTechPost
Recently OpenAI announced the launch of its 1.5 billion parameter language model GPT-2. GPT-2 has been in the news as the scary AI text generator with potential threats regarding fake news stories, and so on. But now we have'GPTrue or False' browser extension that displays the GPT-2 Log Probability of selected portions of text. This browser extension allows you to select text on a website and finds out what you selected is written using OpenAI's GPT-2 A.I. model. GPTrue or False is available both for Chrome and Firefox.
OpenAI Benchmarks Reinforcement Learning To Avoid Model Overfitting
OpenAI has benchmarked reinforcement learning by mitigating most of its problems using the procedural generational technique. RL has been a central methodology in the field of artificial intelligence. However, over the years, researchers have witnessed a few shortcomings with the approach. Developers often use a colossal amount of data to train and increase the efficiency of machine learning models. But this has resulted in overfitting of data in many cases, thereby, causing hindrance in the adoption of ML technologies.
OpenAI Open Sources Safety Gym to Improve Safety in Reinforcement Learning Agents
Safety is one of the emerging concerns in deep learning systems. In the context of deep learning systems, safety is related to building agents that respect safety dynamics in a given environment. In many cases such as supervised learning, safety is modeled as part of the training datasets. However, other methods such as reinforcement learning require agents to master the dynamics of the environments by experimenting with it which introduces its own set of safety concerns. To address some of these challenges, OpenAI has recently open sourced Safety Gym, a suite of environments and tools for measuring progress towards reinforcement learning agents that respect safety constraints while training.
Roberto G.E. Martín on LinkedIn: #AI #RL
On April 13th, 2019, OpenAI Five became the first AI system to defeat the world champions at an esports game. The game of Dota 2 presents novel challenges for AI systems such as long-time horizons, imperfect information, and complex, continuous state-action spaces, all challenges which will become increasingly central to more capable AI systems.