AITopics | Generative AI

Collaborating Authors

Generative AI

News Overviews Instructional Materials AI-Alerts Classics

AI Can Do Great Things--if It Doesn't Burn the Planet

#artificialintelligenceJan-22-2020, 09:48:41 GMT

Last month, researchers at OpenAI in San Francisco revealed an algorithm capable of learning, through trial and error, how to manipulate the pieces of a Rubik's Cube using a robotic hand. It was a remarkable research feat, but it required more than 1,000 desktop computers plus a dozen machines running specialized graphics chips crunching intensive calculations for several months. The effort may have consumed about 2.8 gigawatt-hours of electricity, estimates Evan Sparks, CEO of Determined AI, a startup that provides software to help companies manage AI projects. A spokesperson for OpenAI questioned the calculation, noting that it makes several assumptions. But OpenAI declined to disclose further details of the project or offer an estimate of the electricity it consumed.

algorithm, electricity, great thing, (10 more...)

#artificialintelligence

AI-Alerts: 2020 > 2020-01 > AAAI AI-Alert for Jan 28, 2020 (1.00)

Country:

North America > United States > California > San Francisco County > San Francisco (0.25)
North America > Canada (0.05)

Industry:

Leisure & Entertainment > Games (0.36)
Energy > Power Industry (0.32)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.81)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.70)

Add feedback

Variational Mixture-of-Experts Autoencoders for Multi-Modal Deep Generative Models

Shi, Yuge, N, Siddharth, Paige, Brooks, Torr, Philip

Neural Information Processing SystemsJan-12-2020, 05:36:54 GMT

Learning generative models that span multiple data modalities, such as vision and language, is often motivated by the desire to learn more useful, generalisable representations that faithfully capture common underlying factors between the modalities. In this work, we characterise successful learning of such models as the fulfilment of four criteria: i) implicit latent decomposition into shared and private subspaces, ii) coherent joint generation over all modalities, iii) coherent cross-generation across individual modalities, and iv) improved model learning for individual modalities through multi-modal integration. Here, we propose a mixture-of-experts multi-modal variational autoencoder (MMVAE) for learning of generative models on different sets of modalities, including a challenging image - language dataset, and demonstrate its ability to satisfy all four criteria, both qualitatively and quantitatively. Papers published at the Neural Information Processing Systems Conference.

modality, multi-modal deep generative model, variational mixture-of-expert autoencoder, (3 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Natural Language > Generation (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.40)

Add feedback

Reward Engineering for Object Pick and Place Training

Nagpal, Raghav, Krishnan, Achyuthan Unni, Yu, Hanshen

arXiv.org Artificial IntelligenceJan-11-2020

Robotic grasping is a crucial area of research as it can result in the acceleration of the automation of several Industries utilizing robots ranging from manufacturing to healthcare. Reinforcement learning is the field of study where an agent learns a policy to execute an action by exploring and exploiting rewards from an environment. Reinforcement learning can thus be used by the agent to learn how to execute a certain task, in our case grasping an object. We have used the Pick and Place environment provided by OpenAI's Gym to engineer rewards. Hindsight Experience Replay (HER) has shown promising results with problems having a sparse reward. In the default configuration of the OpenAI baseline and environment the reward function is calculated using the distance between the target location and the robot end-effector. By weighting the cost based on the distance of the end-effector from the goal in the x,y and z-axes we were able to almost halve the learning time compared to the baselines provided by OpenAI, an intuitive strategy that further reduced learning time. In this project, we were also able to introduce certain user desired trajectories in the learnt policies (city-block / Manhattan trajectories). This helps us understand that by engineering the rewards we can tune the agent to learn policies in a certain way even if it might not be the most optimal but is the desired manner.

algorithm, reward function, trajectory, (15 more...)

arXiv.org Artificial Intelligence

2001.03792

Country:

North America > United States > Massachusetts > Worcester County > Worcester (0.04)
North America > Canada (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.90)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.68)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.68)
(2 more...)

Add feedback

New Projects See GPT-2 Summarizing Movies, Playing Chess

#artificialintelligenceJan-8-2020, 22:04:41 GMT

New Netflix and Chess applications have once again illustrated the range and potential of OpenAI's GPT-2 language model. GPT-2 became an overnight sensation when OpenAI released it in February 2019, receiving critical acclaim worldwide. GPT-2 is a successor to GPT, a large transformer-based language model with 1.5 million parameters and trained on 8 million web pages. GPT-2 can automatically generate coherent paragraphs of text, basically predicting the next word given previous words. One of the big challenges for language models is domain-specific datasets that involve a lot of expert knowledge. That's GPT-2's charm point -- it seems free of such constraints, outperforming other language models on specific domains like Wikipedia, news, or books without using any domain-specific training datasets.

dataset, gpt-2, language model, (4 more...)

#artificialintelligence

Industry:

Leisure & Entertainment > Games > Chess (1.00)
Media > Television (0.69)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.53)

Add feedback

Granular Learning with Deep Generative Models using Highly Contaminated Data

Just, John

arXiv.org Machine LearningJan-6-2020

An approach to utilize recent advances in deep generative models for anomaly detection in a granular (continuous) sense on a real-world image dataset with quality issues is detailed using recent normalizing flow models, with implications in many other applications/domains/data types. The approach is completely unsupervised (no annotations available) but qualitatively shown to provide accurate semantic labeling for images via heatmaps of the scaled log-likelihood overlaid on the images. When sorted based on the median values per image, clear trends in quality are observed. Furthermore, downstream classification is shown to be possible and effective via a weakly supervised approach using the log-likelihood output from a normalizing flow model as a training signal for a feature-extracting convolutional neural network. The pre-linear dense layer outputs on the CNN are shown to disentangle high level representations and efficiently cluster various quality issues. Thus, an entirely non-annotated (fully unsupervised) approach is shown possible for accurate estimation and classification of quality issues..

architecture, international conference, just & ghosal, (12 more...)

arXiv.org Machine Learning

2001.04297

Country:

North America > United States > California > Los Angeles County > Long Beach (0.14)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
North America > United States > Utah > Salt Lake County > Salt Lake City (0.04)
(2 more...)

Genre: Research Report (0.40)

Industry: Health & Medicine (0.94)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.62)

Add feedback

Variational Mixture-of-Experts Autoencoders for Multi-Modal Deep Generative Models

Shi, Yuge, N, Siddharth, Paige, Brooks, Torr, Philip

Neural Information Processing SystemsDec-31-2019

Learning generative models that span multiple data modalities, such as vision and language, is often motivated by the desire to learn more useful, generalisable representations that faithfully capture common underlying factors between the modalities. In this work, we characterise successful learning of such models as the fulfilment of four criteria: i) implicit latent decomposition into shared and private subspaces, ii) coherent joint generation over all modalities, iii) coherent cross-generation across individual modalities, and iv) improved model learning for individual modalities through multi-modal integration. Here, we propose a mixture-of-experts multimodal variational autoencoder (MMVAE) to learn generative models on different sets of modalities, including a challenging image language dataset, and demonstrate its ability to satisfy all four criteria, both qualitatively and quantitatively. Code, data, and models are provided at this url.

generative model, international conference, modality, (15 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
Asia > Middle East > Jordan (0.05)
North America > United States > California > Los Angeles County > Long Beach (0.04)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.40)

Add feedback

This Browser Extension 'GPTrue or False' Can Identify AI Written Content MarkTechPost

#artificialintelligenceDec-25-2019, 21:20:00 GMT

Recently OpenAI announced the launch of its 1.5 billion parameter language model GPT-2. GPT-2 has been in the news as the scary AI text generator with potential threats regarding fake news stories, and so on. But now we have'GPTrue or False' browser extension that displays the GPT-2 Log Probability of selected portions of text. This browser extension allows you to select text on a website and finds out what you selected is written using OpenAI's GPT-2 A.I. model. GPTrue or False is available both for Chrome and Firefox.

browser extension, content marktechpost, extension, (3 more...)

#artificialintelligence

Industry: Media > News (0.69)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.60)

Add feedback

OpenAI Benchmarks Reinforcement Learning To Avoid Model Overfitting

#artificialintelligenceDec-25-2019, 08:10:29 GMT

OpenAI has benchmarked reinforcement learning by mitigating most of its problems using the procedural generational technique. RL has been a central methodology in the field of artificial intelligence. However, over the years, researchers have witnessed a few shortcomings with the approach. Developers often use a colossal amount of data to train and increase the efficiency of machine learning models. But this has resulted in overfitting of data in many cases, thereby, causing hindrance in the adoption of ML technologies.

large language model, machine learning, reinforcement learning, (17 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.78)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.71)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.71)

Add feedback

OpenAI Open Sources Safety Gym to Improve Safety in Reinforcement Learning Agents

#artificialintelligenceDec-22-2019, 12:27:33 GMT

Safety is one of the emerging concerns in deep learning systems. In the context of deep learning systems, safety is related to building agents that respect safety dynamics in a given environment. In many cases such as supervised learning, safety is modeled as part of the training datasets. However, other methods such as reinforcement learning require agents to master the dynamics of the environments by experimenting with it which introduces its own set of safety concerns. To address some of these challenges, OpenAI has recently open sourced Safety Gym, a suite of environments and tools for measuring progress towards reinforcement learning agents that respect safety constraints while training.

agent, reinforcement, safety gym, (12 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.64)

Add feedback

Roberto G.E. Martín on LinkedIn: #AI #RL

#artificialintelligenceDec-15-2019, 09:41:50 GMT

On April 13th, 2019, OpenAI Five became the first AI system to defeat the world champions at an esports game. The game of Dota 2 presents novel challenges for AI systems such as long-time horizons, imperfect information, and complex, continuous state-action spaces, all challenges which will become increasingly central to more capable AI systems.

ai system, dota 2, linkedin

#artificialintelligence

Technology:

Information Technology > Communications > Social Media (0.85)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.41)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.38)
(2 more...)

Add feedback