Maintaining the Illusion of Reality: Transfer in RL by Keeping Agents in the DARC
Reinforcement learning (RL) is often touted as a promising approach for costly and risk-sensitive applications, yet practicing and learning in those domains directly is expensive. It costs time (e.g., OpenAI's Dota2 project used 10,000 years of experience), it costs money (e.g., "inexpensive" robotic arms used in research typically cost $10,000 to $30,000), and it could even be dangerous to humans. How can an intelligent agent learn to solve tasks in environments in which it cannot practice? For many tasks, such as assistive robotics and self-driving cars, we may have access to a different practice area, which we will call the source domain. While the source domain has different dynamics than the target domain, experience in the source domain is much cheaper to collect.
Jul-31-2020, 19:10:33 GMT
- Country:
- North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.40)
- Genre:
- Research Report (0.68)
- Industry:
- Transportation > Ground > Road (0.49)
- Technology: