Generative AI
A Factorial Mixture Prior for Compositional Deep Generative Models
Paquet, Ulrich, Ghaisas, Sumedh K., Tieleman, Olivier
We assume that a high-dimensional datum, like an image, is a compositional expression of a set of properties, with a complicated non-linear relationship between the datum and its properties. This paper proposes a factorial mixture prior for capturing latent properties, thereby adding structured compositionality to deep generative models. The prior treats a latent vector as belonging to Cartesian product of subspaces, each of which is quantized separately with a Gaussian mixture model. Some mixture components can be set to represent properties as observed random variables whenever labeled properties are present. Through a combination of stochastic variational inference and gradient descent, a method for learning how to infer discrete properties in an unsupervised or semi-supervised way is outlined and empirically evaluated.
Physics-informed deep generative models
We consider the application of deep generative models in propagating uncertainty through complex physical systems. Specifically, we put forth an implicit variational inference formulation that constrains the generative model output to satisfy given physical laws expressed by partial differential equations. Such physics-informed constraints provide a regularization mechanism for effectively training deep probabilistic models for modeling physical systems in which the cost of data acquisition is high and training data-sets are typically small. This provides a scalable framework for characterizing uncertainty in the outputs of physical systems due to randomness in their inputs or noise in their observations. We demonstrate the effectiveness of our approach through a canonical example in transport dynamics.
Counterfactuals uncover the modular structure of deep generative models
Besserve, Michel, Sun, Rรฉmy, Schรถlkopf, Bernhard
Deep generative models such as Generative Adversarial Networks (GANs) and Variational Auto-Encoders (VAEs) are important tools to capture and investigate the properties of complex empirical data. However, the complexity of their inner elements makes their functioning challenging to assess and modify. In this respect, these architectures behave as black box models. In order to better understand the function of such networks, we analyze their modularity based on the counterfactual manipulation of their internal variables. Experiments with face images support that modularity between groups of channels is achieved to some degree within convolutional layers of vanilla VAE and GAN generators. This helps understand the functional organization of these systems and allows designing meaningful transformations of the generated images without further training.
Deep Variational Transfer: Transfer Learning through Semi-supervised Deep Generative Models
Belhaj, Marouan, Protopapas, Pavlos, Pan, Weiwei
In real-world applications, it is often expensive and time-consuming to obtain labeled examples. In such cases, knowledge transfer from related domains, where labels are abundant, could greatly reduce the need for extensive labeling efforts. In this scenario, transfer learning comes in hand. In this paper, we propose Deep Variational Transfer (DVT), a variational autoencoder that transfers knowledge across domains using a shared latent Gaussian mixture model. Thanks to the combination of a semi-supervised ELBO and parameters sharing across domains, we are able to simultaneously: (i) align all supervised examples of the same class into the same latent Gaussian Mixture component, independently from their domain; (ii) predict the class of unsupervised examples from different domains and use them to better model the occurring shifts. We perform tests on MNIST and USPS digits datasets, showing DVT's ability to perform transfer learning across heterogeneous datasets. Additionally, we present DVT's top classification performances on the MNIST semi-supervised learning challenge. We further validate DVT on a astronomical datasets. DVT achieves states-of-the-art classification performances, transferring knowledge across real stars surveys datasets, EROS, MACHO and HiTS, . In the worst performance, we double the achieved F1-score for rare classes. These experiments show DVT's ability to tackle all major challenges posed by transfer learning: different covariate distributions, different and highly imbalanced class distributions and different feature spaces.
Relative Entropy Regularized Policy Iteration
Abdolmaleki, Abbas, Springenberg, Jost Tobias, Degrave, Jonas, Bohez, Steven, Tassa, Yuval, Belov, Dan, Heess, Nicolas, Riedmiller, Martin
We present an off-policy actor-critic algorithm for Reinforcement Learning (RL) that combines ideas from gradient-free optimization via stochastic search with learned action-value function. The result is a simple procedure consisting of three steps: i) policy evaluation by estimating a parametric action-value function; ii) policy improvement via the estimation of a local non-parametric policy; and iii) generalization by fitting a parametric policy. Each step can be implemented in different ways, giving rise to several algorithm variants. Our algorithm draws on connections to existing literature on black-box optimization and 'RL as an inference' and it can be seen either as an extension of the Maximum a Posteriori Policy Optimisation algorithm (MPO) [Abdolmaleki et al., 2018a], or as an extension of Trust Region Covariance Matrix Adaptation Evolutionary Strategy (CMA-ES) [Abdolmaleki et al., 2017b; Hansen et al., 1997] to a policy iteration scheme. Our comparison on 31 continuous control tasks from parkour suite [Heess et al., 2017], DeepMind control suite [Tassa et al., 2018] and OpenAI Gym [Brockman et al., 2016] with diverse properties, limited amount of compute and a single set of hyperparameters, demonstrate the effectiveness of our method and the state of art results. Videos, summarizing results, can be found at goo.gl/HtvJKR .
Void Filling of Digital Elevation Models with Deep Generative Models
Gavriil, Konstantinos, Muntingh, Georg, Barrowclough, Oliver J. D.
In recent years, advances in machine learning algorithms, cheap computational resources, and the availability of big data have spurred the deep learning revolution in various application domains. In particular, supervised learning techniques in image analysis have led to superhuman performance in various tasks, such as classification, localization, and segmentation, while unsupervised learning techniques based on increasingly advanced generative models have been applied to generate high-resolution synthetic images indistinguishable from real images. In this paper we consider a state-of-the-art machine learning model for image inpainting, namely a Wasserstein Generative Adversarial Network based on a fully convolutional architecture with a contextual attention mechanism. We show that this model can successfully be transferred to the setting of digital elevation models (DEMs) for the purpose of generating semantically plausible data for filling voids. Training, testing and experimentation is done on GeoTIFF data from various regions in Norway, made openly available by the Norwegian Mapping Authority.
Teaching AI to read is harder than it seems
Researchers have shown that rapidly improving AI techniques can facilitate the creation of fake images that look real. As these kinds of technologies move into the language field as well, Howard says, we may need to be more skeptical than ever about what we encounter online. These new language systems learn by analysing millions of sentences written by humans. A system built by OpenAI, a lab based in San Francisco, analysed thousands of self-published books, including romance novels, science fiction and more. Each system learned a particular skill by analysing all that text.
OpenAI and DeepMind AI system achieves 'superhuman' performance in Pong and Enduro
Machines learning to play games by watching humans might sound like the plot of a science fiction novel, but that's exactly what researchers at OpenAI -- a nonprofit, San Francisco-based AI research company backed by Elon Musk, Reid Hoffman, and Peter Thiel, among other tech luminaries -- and Google subsidiary DeepMind claim to have accomplished. In a paper published on the preprint server Arxiv.org Their deep neural network -- which, like other neural networks, consists of mathematical functions loosely modeled on neurons in the brain -- achieved superhuman performance on two out of the nine Atari games tested (Pong and Enduro) and beat baseline models in seven. The research was submitted to the Neural Information Processing Systems (NIPS 2018), which is scheduled to take place in Montreal, Canada during the first week in December. "To solve complex real-world problems with reinforcement learning, we cannot rely on manually specified reward functions," the team wrote.
OpenAI launches reinforcement learning training to prepare for artificial general intelligence
OpenAI today announced the launch of Spinning Up, a program designed to teach anyone deep reinforcement learning. OpenAI is well known for making funky-looking agents in virtual environments that learn how to walk on their own such as Humanoid v2 or POLO, a collaboration with University of Washington. Reinforcement learning involves providing reward signals to an agent in an environment incentivized to maximize its reward to meet a goal. RL has played a role in major AI breakthroughs such as Google DeepMind's AlphaGo and agents trained in environments like Dota 2. Spinning Up includes a collection of important reinforcement learning research papers, a glossary of terminology necessary to understand RL, and a collection of algorithms for running exercises. The program is being launched not just to help people learn how reinforcement learning works, but to make progress towards OpenAI's general goal of safely creating artificial general intelligence (AGI) by involving more people from fields beyond computer science. "Solving AI safety will require people with a wide range of expertise and perspectives, and many relevant professions have no connection to engineering or computer science at all.
Bias and Generalization in Deep Generative Models: An Empirical Study
Zhao, Shengjia, Ren, Hongyu, Yuan, Arianna, Song, Jiaming, Goodman, Noah, Ermon, Stefano
In high dimensional settings, density estimation algorithms rely crucially on their inductive bias. Despite recent empirical success, the inductive bias of deep generative models is not well understood. In this paper we propose a framework to systematically investigate bias and generalization in deep generative models of images. Inspired by experimental methods from cognitive psychology, we probe each learning algorithm with carefully designed training datasets to characterize when and how existing models generate novel attributes and their combinations. We identify similarities to human psychology and verify that these patterns are consistent across commonly used models and architectures.