AITopics | Generative AI

OpenAI

#artificialintelligenceJun-14-2017, 17:35:39 GMT

deep learning, large language model, openai, (4 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.40)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.40)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.40)

Add feedback

Learning from Human Preferences

#artificialintelligenceJun-14-2017, 15:50:21 GMT

One step towards building safe AI systems is to remove the need for humans to write goal functions, since using a simple proxy for a complex goal, or getting the complex goal a bit wrong, can lead to undesirable and even dangerous behavior. In collaboration with DeepMind's safety team, we've developed an algorithm which can infer what humans want by being told which of two proposed behaviors is better. We present a learning algorithm that uses small amounts of human feedback to solve modern RL environments. Machine learning systems with human feedback have been explored before, but we've scaled up the approach to be able to work on much more complicated tasks. Our algorithm needed 900 bits of feedback from a human evaluator to learn to backflip -- a seemingly simple task which is simple to judge but challenging to specify.

human feedback, large language model, machine learning, (20 more...)

#artificialintelligence

Industry: Leisure & Entertainment > Games (0.32)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.87)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.73)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.41)

Add feedback

OpenAI, DeepMind double team to make future AI machines safer

#artificialintelligenceJun-14-2017, 10:50:07 GMT

Researchers from OpenAI and DeepMind are hoping to make artificial intelligence safer using a new algorithm that learns from human feedback. Both companies are experts in reinforcement learning – an area of machine learning that rewards agents if they take the right actions to complete a task under a given environment. The goal is specified through an algorithm, and the agent is programmed to chase the reward, like winning points in a game. Reinforcement learning has been successful in teaching machines how to play games like Doom or Pong or drive autonomous cars via simulation. It's a powerful method to explore an agent's behavior, but it can be dangerous if the hard-coded algorithm is wrong or produces undesirable effects.

large language model, machine learning, natural language, (14 more...)

#artificialintelligence

Genre: Research Report (0.77)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.66)

Add feedback

Learning to Cooperate, Compete, and Communicate

#artificialintelligenceJun-10-2017, 15:40:13 GMT

Multiagent environments where agents compete for resources are stepping stones on the path to AGI. Multiagent environments have two useful properties: first, there is a natural curriculum -- the difficulty of the environment is determined by the skill of your competitors (and if you're competing against clones of yourself, the environment exactly matches your skill level). Second, a multiagent environment has no stable equilibrium: no matter how smart an agent is, there's always pressure to get smarter. These environments have a very different feel from traditional environments, and it'll take a lot more research before we become good at them. We've developed a new algorithm, MADDPG, for centralized learning and decentralized execution in multiagent environments, allowing agents to learn to collaborate and compete with each other.

artificial intelligence, deep learning, machine learning, (15 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.41)

Add feedback

OpenAI's new approach for one-shot imitation learning, a peek into the future of AI

#artificialintelligenceJun-7-2017, 22:10:21 GMT

On May 16, OpenAI researchers shared a video of one of their projects along with two papers of importance exploring solutions to three key bottlenecks of current AI development: meta-learning, one-shot learning, and automated data generation. In my previous post, I promised an article dedicated to the fascinating problem of one-shot learning, so here goes. In this video you see a one-arm physical robot stacking cubes on top of each other. Knowing the complex tasks that industrial robots are currently able to perform, if the researcher was not trying to explain what is going on, on many accounts this would be very underwhelming. In controlled environment the task is simple, procedural (hard-coded) approaches have solved this problems already, what is promising and revolutionary is how much the general framework underneath could scale up to multiple, more complex and adaptive behaviors in noisier environments.

artificial intelligence, deep learning, machine learning, (8 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.63)

Add feedback

Elon Musk's OpenAI breaks new ground in AI research - IoT Agenda

#artificialintelligenceJun-6-2017, 01:40:21 GMT

At the core of the AI system are two different neural networks -- a vision network and an imitation network. These two work behind the scenes to provide the remarkable capability to imitate human actions, a giant step closer to building true AI systems. A robotic arm repeats the process of picking up blocks and stacking them in a particular configuration. It does this by witnessing just once a simulated demonstration performed by a human using a VR headset. Researchers have used thousands of simulated images to train the vision network.

artificial intelligence, machine learning, natural language, (10 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Robots (0.80)
Information Technology > Artificial Intelligence > Natural Language (0.76)
Information Technology > Human Computer Interaction > Interfaces (0.62)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.40)

Add feedback

Elon Musk's OpenAI breaks new ground in AI research

#artificialintelligenceJun-6-2017, 01:40:05 GMT

Elon Musk keeps surprising the world with his technological breakthroughs. OpenAI, a non-profit company focused on AI research, recently made an announcement regarding its groundbreaking AI invention. It has developed an AI system that can complete an actual physical task after watching just one demonstration of the task. At the core of the AI system are two different neural networks -- a vision network and an imitation network. These two work behind the scenes to provide the remarkable capability to imitate human actions, a giant step closer to building true AI systems.

large language model, machine learning, natural language, (12 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.69)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.69)

Add feedback

[R] [1706.00550] On Unifying Deep Generative Models • r/MachineLearning

@machinelearnbotJun-5-2017, 15:25:08 GMT

Deep generative models have achieved impressive success in recent years. Generative Adversarial Networks (GANs) and Variational Autoencoders (VAEs), as powerful frameworks for deep generative model learning, have largely been considered as two distinct paradigms and received extensive independent study respectively. This paper establishes formal connections between deep generative modeling approaches through a new formulation of GANs and VAEs. We show that GANs and VAEs are essentially minimizing KL divergences with opposite directions and reversed latent/visible treatments, extending the two learning phases of classic wake-sleep algorithm, respectively. The unified view provides a powerful tool to analyze a diverse set of existing model variants, and enables to exchange ideas across research lines in a principled way.

deep learning, machine learning, machinelearning, (2 more...)

@machinelearnbot

Industry: Media > News (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.93)

Add feedback

Robots that Learn

#artificialintelligenceMay-27-2017, 21:46:13 GMT

Last month, we showed an earlier version of this robot where we'd trained its vision system using domain randomization, that is, by showing it simulated objects with a variety of color, backgrounds, and textures, without the use of any real images. Now, we've developed and deployed a new algorithm, one-shot imitation learning, allowing a human to communicate how to do a new task by performing it in VR. Given a single demonstration, the robot is able to solve the same task from an arbitrary starting configuration. Caption: Our system can learn a behavior from a single demonstration delivered within a simulator, then reproduce that behavior in different setups in reality. The system is powered by two neural networks: a vision network and an imitation network. The vision network ingests an image from the robot's camera and outputs state representing the positions of the objects.

artificial intelligence, demonstration, machine learning, (18 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.42)

Add feedback

[P] OpenAI Baselines: DQN • r/MachineLearning

@machinelearnbotMay-24-2017, 17:40:16 GMT

This is probably sarcasm, but the point is to get baselines for algorithms which nowadays have lots of tiny tricks which aren't reported in the paper. I had to go through spragnur's DQN code to get those tricks and it took a LONG time...

large language model, machine learning, natural language, (6 more...)

@machinelearnbot

Industry: Media > News (0.40)

Technology: