AITopics | Generative AI

Collaborating Authors

Generative AI

News Overviews Instructional Materials AI-Alerts Classics

A Factorial Mixture Prior for Compositional Deep Generative Models

Paquet, Ulrich, Ghaisas, Sumedh K., Tieleman, Olivier

arXiv.org Artificial IntelligenceDec-18-2018

We assume that a high-dimensional datum, like an image, is a compositional expression of a set of properties, with a complicated non-linear relationship between the datum and its properties. This paper proposes a factorial mixture prior for capturing latent properties, thereby adding structured compositionality to deep generative models. The prior treats a latent vector as belonging to Cartesian product of subspaces, each of which is quantized separately with a Gaussian mixture model. Some mixture components can be set to represent properties as observed random variables whenever labeled properties are present. Through a combination of stochastic variational inference and gradient descent, a method for learning how to infer discrete properties in an unsupervised or semi-supervised way is outlined and empirically evaluated.

artificial intelligence, ikd, machine learning, (16 more...)

arXiv.org Artificial Intelligence

1812.0748

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.62)

Add feedback

Physics-informed deep generative models

Yang, Yibo, Perdikaris, Paris

arXiv.org Machine LearningDec-9-2018

We consider the application of deep generative models in propagating uncertainty through complex physical systems. Specifically, we put forth an implicit variational inference formulation that constrains the generative model output to satisfy given physical laws expressed by partial differential equations. Such physics-informed constraints provide a regularization mechanism for effectively training deep probabilistic models for modeling physical systems in which the cost of data acquisition is high and training data-sets are typically small. This provides a scalable framework for characterizing uncertainty in the outputs of physical systems due to randomness in their inputs or noise in their observations. We demonstrate the effectiveness of our approach through a canonical example in transport dynamics.

artificial intelligence, machine learning, neural network, (16 more...)

arXiv.org Machine Learning

1812.03511

Country: North America > United States > Pennsylvania (0.28)

Genre: Research Report > New Finding (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.61)

Add feedback

Counterfactuals uncover the modular structure of deep generative models

Besserve, Michel, Sun, Rémy, Schölkopf, Bernhard

arXiv.org Machine LearningDec-7-2018

Deep generative models such as Generative Adversarial Networks (GANs) and Variational Auto-Encoders (VAEs) are important tools to capture and investigate the properties of complex empirical data. However, the complexity of their inner elements makes their functioning challenging to assess and modify. In this respect, these architectures behave as black box models. In order to better understand the function of such networks, we analyze their modularity based on the counterfactual manipulation of their internal variables. Experiments with face images support that modularity between groups of channels is achieved to some degree within convolutional layers of vanilla VAE and GAN generators. This helps understand the functional organization of these systems and allows designing meaningful transformations of the generated images without further training.

artificial intelligence, generative model, machine learning, (20 more...)

arXiv.org Machine Learning

1812.03253

Country: Europe (0.46)

Genre: Research Report > New Finding (0.68)

Industry: Health & Medicine (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.61)

Add feedback

Deep Variational Transfer: Transfer Learning through Semi-supervised Deep Generative Models

Belhaj, Marouan, Protopapas, Pavlos, Pan, Weiwei

arXiv.org Machine LearningDec-7-2018

In real-world applications, it is often expensive and time-consuming to obtain labeled examples. In such cases, knowledge transfer from related domains, where labels are abundant, could greatly reduce the need for extensive labeling efforts. In this scenario, transfer learning comes in hand. In this paper, we propose Deep Variational Transfer (DVT), a variational autoencoder that transfers knowledge across domains using a shared latent Gaussian mixture model. Thanks to the combination of a semi-supervised ELBO and parameters sharing across domains, we are able to simultaneously: (i) align all supervised examples of the same class into the same latent Gaussian Mixture component, independently from their domain; (ii) predict the class of unsupervised examples from different domains and use them to better model the occurring shifts. We perform tests on MNIST and USPS digits datasets, showing DVT's ability to perform transfer learning across heterogeneous datasets. Additionally, we present DVT's top classification performances on the MNIST semi-supervised learning challenge. We further validate DVT on a astronomical datasets. DVT achieves states-of-the-art classification performances, transferring knowledge across real stars surveys datasets, EROS, MACHO and HiTS, . In the worst performance, we double the achieved F1-score for rare classes. These experiments show DVT's ability to tackle all major challenges posed by transfer learning: different covariate distributions, different and highly imbalanced class distributions and different feature spaces.

artificial intelligence, deep learning, machine learning, (15 more...)

arXiv.org Machine Learning

1812.03123

Country: North America > United States (0.49)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.41)

Add feedback

Relative Entropy Regularized Policy Iteration

Abdolmaleki, Abbas, Springenberg, Jost Tobias, Degrave, Jonas, Bohez, Steven, Tassa, Yuval, Belov, Dan, Heess, Nicolas, Riedmiller, Martin

arXiv.org Machine LearningDec-5-2018

We present an off-policy actor-critic algorithm for Reinforcement Learning (RL) that combines ideas from gradient-free optimization via stochastic search with learned action-value function. The result is a simple procedure consisting of three steps: i) policy evaluation by estimating a parametric action-value function; ii) policy improvement via the estimation of a local non-parametric policy; and iii) generalization by fitting a parametric policy. Each step can be implemented in different ways, giving rise to several algorithm variants. Our algorithm draws on connections to existing literature on black-box optimization and 'RL as an inference' and it can be seen either as an extension of the Maximum a Posteriori Policy Optimisation algorithm (MPO) [Abdolmaleki et al., 2018a], or as an extension of Trust Region Covariance Matrix Adaptation Evolutionary Strategy (CMA-ES) [Abdolmaleki et al., 2017b; Hansen et al., 1997] to a policy iteration scheme. Our comparison on 31 continuous control tasks from parkour suite [Heess et al., 2017], DeepMind control suite [Tassa et al., 2018] and OpenAI Gym [Brockman et al., 2016] with diverse properties, limited amount of compute and a single set of hyperparameters, demonstrate the effectiveness of our method and the state of art results. Videos, summarizing results, can be found at goo.gl/HtvJKR .

artificial intelligence, machine learning, reinforcement learning, (20 more...)

arXiv.org Machine Learning

1812.02256

Genre: Research Report > New Finding (0.46)

Industry:

Leisure & Entertainment (0.46)
Transportation (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.34)

Add feedback

Void Filling of Digital Elevation Models with Deep Generative Models

Gavriil, Konstantinos, Muntingh, Georg, Barrowclough, Oliver J. D.

arXiv.org Artificial IntelligenceNov-30-2018

In recent years, advances in machine learning algorithms, cheap computational resources, and the availability of big data have spurred the deep learning revolution in various application domains. In particular, supervised learning techniques in image analysis have led to superhuman performance in various tasks, such as classification, localization, and segmentation, while unsupervised learning techniques based on increasingly advanced generative models have been applied to generate high-resolution synthetic images indistinguishable from real images. In this paper we consider a state-of-the-art machine learning model for image inpainting, namely a Wasserstein Generative Adversarial Network based on a fully convolutional architecture with a contextual attention mechanism. We show that this model can successfully be transferred to the setting of digital elevation models (DEMs) for the purpose of generating semantically plausible data for filling voids. Training, testing and experimentation is done on GeoTIFF data from various regions in Norway, made openly available by the Norwegian Mapping Authority.

artificial intelligence, deep learning, machine learning, (14 more...)

arXiv.org Artificial Intelligence

1811.12693

Country:

Europe > Norway (0.50)
Europe > Austria > Vienna (0.14)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.41)

Add feedback

Teaching AI to read is harder than it seems

#artificialintelligenceNov-27-2018, 10:18:45 GMT

Researchers have shown that rapidly improving AI techniques can facilitate the creation of fake images that look real. As these kinds of technologies move into the language field as well, Howard says, we may need to be more skeptical than ever about what we encounter online. These new language systems learn by analysing millions of sentences written by humans. A system built by OpenAI, a lab based in San Francisco, analysed thousands of self-published books, including romance novels, science fiction and more. Each system learned a particular skill by analysing all that text.

large language model, machine learning, natural language, (6 more...)

#artificialintelligence

Country: North America > United States > California > San Francisco County > San Francisco (0.29)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.48)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.34)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.34)

Add feedback

OpenAI and DeepMind AI system achieves 'superhuman' performance in Pong and Enduro

#artificialintelligenceNov-17-2018, 04:22:30 GMT

Machines learning to play games by watching humans might sound like the plot of a science fiction novel, but that's exactly what researchers at OpenAI -- a nonprofit, San Francisco-based AI research company backed by Elon Musk, Reid Hoffman, and Peter Thiel, among other tech luminaries -- and Google subsidiary DeepMind claim to have accomplished. In a paper published on the preprint server Arxiv.org Their deep neural network -- which, like other neural networks, consists of mathematical functions loosely modeled on neurons in the brain -- achieved superhuman performance on two out of the nine Atari games tested (Pong and Enduro) and beat baseline models in seven. The research was submitted to the Neural Information Processing Systems (NIPS 2018), which is scheduled to take place in Montreal, Canada during the first week in December. "To solve complex real-world problems with reinforcement learning, we cannot rely on manually specified reward functions," the team wrote.

artificial intelligence, machine learning, pong and enduro, (17 more...)

#artificialintelligence

Country:

North America > United States > California > San Francisco County > San Francisco (0.25)
North America > Canada > Quebec > Montreal (0.25)

Genre: Research Report (0.56)

Industry: Leisure & Entertainment > Games > Computer Games (0.74)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.64)

Add feedback

OpenAI launches reinforcement learning training to prepare for artificial general intelligence

#artificialintelligenceNov-10-2018, 12:27:57 GMT

OpenAI today announced the launch of Spinning Up, a program designed to teach anyone deep reinforcement learning. OpenAI is well known for making funky-looking agents in virtual environments that learn how to walk on their own such as Humanoid v2 or POLO, a collaboration with University of Washington. Reinforcement learning involves providing reward signals to an agent in an environment incentivized to maximize its reward to meet a goal. RL has played a role in major AI breakthroughs such as Google DeepMind's AlphaGo and agents trained in environments like Dota 2. Spinning Up includes a collection of important reinforcement learning research papers, a glossary of terminology necessary to understand RL, and a collection of algorithms for running exercises. The program is being launched not just to help people learn how reinforcement learning works, but to make progress towards OpenAI's general goal of safely creating artificial general intelligence (AGI) by involving more people from fields beyond computer science. "Solving AI safety will require people with a wide range of expertise and perspectives, and many relevant professions have no connection to engineering or computer science at all.

large language model, machine learning, reinforcement, (14 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (1.00)

Add feedback

Bias and Generalization in Deep Generative Models: An Empirical Study

Zhao, Shengjia, Ren, Hongyu, Yuan, Arianna, Song, Jiaming, Goodman, Noah, Ermon, Stefano

arXiv.org Machine LearningNov-7-2018

In high dimensional settings, density estimation algorithms rely crucially on their inductive bias. Despite recent empirical success, the inductive bias of deep generative models is not well understood. In this paper we propose a framework to systematically investigate bias and generalization in deep generative models of images. Inspired by experimental methods from cognitive psychology, we probe each learning algorithm with carefully designed training datasets to characterize when and how existing models generate novel attributes and their combinations. We identify similarities to human psychology and verify that these patterns are consistent across commonly used models and architectures.

artificial intelligence, machine learning, proportion, (17 more...)

arXiv.org Machine Learning

1811.03259

Country: North America > United States (0.28)

Genre: Research Report > New Finding (0.46)

Industry: Health & Medicine > Therapeutic Area > Neurology (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.81)

Add feedback