Generative AI
A Deep Generative Model of Speech Complex Spectrograms
Nugraha, Aditya Arie, Sekiguchi, Kouhei, Yoshii, Kazuyoshi
This paper proposes an approach to the joint modeling of the short-time Fourier transform magnitude and phase spectrograms with a deep generative model. We assume that the magnitude follows a Gaussian distribution and the phase follows a von Mises distribution. To improve the consistency of the phase values in the time-frequency domain, we also apply the von Mises distribution to the phase derivatives, i.e., the group delay and the instantaneous frequency. Based on these assumptions, we explore and compare several combinations of loss functions for training our models. Built upon the variational autoencoder framework, our model consists of three convolutional neural networks acting as an encoder, a magnitude decoder, and a phase decoder. In addition to the latent variables, we propose to also condition the phase estimation on the estimated magnitude. Evaluated for a time-domain speech reconstruction task, our models could generate speech with a high perceptual quality and a high intelligibility.
Neural MMO - A Massively Multiagent Game Environment
Our platform supports a large, variable number of agents within a persistent and open-ended task. The inclusion of many agents and species leads to better exploration, divergent niche formation, and greater overall competence. In recent years, multiagent settings have become an effective platform for deep reinforcement learning research. Despite this progress, there are still two main challenges for multiagent reinforcement learning. We need to create open-ended tasks with a high complexity ceiling: current environments are either complex but too narrow or open-ended but too simple.
OpenAI launches Neural MMO, a massive reinforcement learning simulator
Artificial intelligence that's beastly at World of Warcraft might not lie too far into the distant future, if OpenAI has its way. The San Francisco research nonprofit today released Neural MMO, a "massively multiagent" virtual training ground that plops agents in the middle of an RPG-like world -- one complete with a resource collection mechanic and player versus player combat. "The game genre of Massively Multiplayer Online Games (MMOs) simulates a large ecosystem of a variable number of players competing in persistent and extensive environments," OpenAI wrote in a blog post. "The inclusion of many agents and species leads to better exploration, divergent niche formation, and greater overall competence." AI agents spawn randomly in Neural MMO environments, which contain automatically generated tile maps of a prespecified size. Some tiles are traversable, like "forest" (which bears food) and "grass," while others aren't (such as water and stone).
Self-adversarial Variational Autoencoder with Gaussian Anomaly Prior Distribution for Anomaly Detection
Wang, Xuhong, Du, Ying, Lin, Shijie, Cui, Ping, Yang, Yupu
Recently, deep generative models have become increasingly popular in unsupervised anomaly detection. However, deep generative models aim at recovering the data distribution rather than detecting anomalies. Besides, deep generative models have the risk of overfitting training samples, which has disastrous effects on anomaly detection performance. To solve the above two problems, we propose a Self-adversarial Variational Autoencoder with a Gaussian anomaly prior assumption. We assume that both the anomalous and the normal prior distribution are Gaussian and have overlaps in the latent space. Therefore, a Gaussian transformer net T is trained to synthesize anomalous but near-normal latent variables. Keeping the original training objective of Variational Autoencoder, besides, the generator G tries to distinguish between the normal latent variables and the anomalous ones synthesized by T, and the encoder E is trained to discriminate whether the output of G is real. These new objectives we added not only give both G and E the ability to discriminate but also introduce additional regularization to prevent overfitting. Compared with the SOTA baselines, the proposed model achieves significant improvements in extensive experiments. Datasets and our model are available at a Github repository.
Reinforcement Learning with Pytorch
Learn to apply Reinforcement Learning and Artificial Intelligence algorithms using Python, Pytorch and OpenAI Gym Artificial Intelligence is dynamically edging its way into our lives. It is already broadly available and we use it - sometimes even not knowing it - on daily basis. Soon it will be our permanent, every day companion. And where can we place Reinforcement Learning in AI world? Definitely this is one of the most promising and fastest growing technologies that can eventually lead us to General Artificial Intelligence!
OpenAI or ClosedAI? That Is A Question.
Better Language Models and Their Implications "Our model, called GPT-2 (a successor to GPT), was trained simply to predict the next word in 40GB of Internet text. Due to our concerns about malicious applications of the technology, we are not releasing the trained model. As an experiment in responsible disclosure, we are instead releasing a much smaller model for researchers to experiment with, as well as a technical paper." AI Researchers Debate the Ethics of Sharing Potentially Harmful Programs The debate has been wide-ranging and sometimes contentious. It even turned into a bit of a meme among AI researchers, who joked that they've had an amazing breakthrough in the lab, but the results were too dangerous to share at the moment.
Twenty minutes into the future with OpenAI's Deep Fake Text AI
In 1985, the TV film Max Headroom: 20 Minutes into the Future presented a science fictional cyberpunk world where an evil media company tried to create an artificial intelligence based on a reporter's brain to generate content to fill airtime. There were somewhat unintended results. Replace "reporter" with "redditors," "evil media company" with "well meaning artificial intelligence researchers," and "airtime" with "a very concerned blog post," and you've got what Ars reported about last week: Generative Pre-trained Transformer-2 (GPT-2), a Franken-creation from researchers at the non-profit research organization OpenAI. Unlike some earlier text-generation systems based on a statistical analysis of text (like those using Markov chains), GPT-2 is a text-generating bot based on a model with 1.5 billion parameters. With or without guidance, GPT-2 can create blocks of text that look like they were written by humans.
OpenAI's new multitalented AI writes, translates, and slanders
OpenAI's researchers knew they were on to something when their language modeling program wrote a convincing essay on a topic they disagreed with. They'd been testing the new AI system by feeding it text prompts, getting it to complete made-up sentences and paragraphs. Then, says David Luan, VP of engineering at the Californian lab, they had the idea of asking it to argue a point they thought was counterintuitive. In this case: why recycling is bad for the world. "And it wrote this really competent, really well-reasoned essay," Luan tells The Verge.
Improving Missing Data Imputation with Deep Generative Models
Camino, Ramiro D., Hammerschmidt, Christian A., State, Radu
Datasets with missing values are very common on industry applications, and they can have a negative impact on machine learning models. Recent studies introduced solutions to the problem of imputing missing values based on deep generative models. Previous experiments with Generative Adversarial Networks and Variational Autoencoders showed interesting results in this domain, but it is not clear which method is preferable for different use cases. The goal of this work is twofold: we present a comparison between missing data imputation solutions based on deep generative models, and we propose improvements over those methodologies. We run our experiments using known real life datasets with different characteristics, removing values at random and reconstructing them with several imputation techniques. Our results show that the presence or absence of categorical variables can alter the selection of the best model, and that some models are more stable than others after similar runs with different random number generator seeds.
OpenAI Text Generator
OpenAI has the entire AI community debating its decision to not release the fully trained version of its powerful new text generator model dubbed GPT-2. I'm going to explain how GPT-2 works using code, math, and animations. We'll discuss its potential applications (both good and bad), ways of preventing misuse, and at the end of the video I'll give my take on whether OpenAI was justified in doing so. The transformer architecture is quickly replacing recurrent networks for sequence learning, and OpenAI's GPT-2 is the latest example of using it at scale. Thats what keeps me going.