isneardoor
Simultaneous Abstract and Concrete Reinforcement Learning
Matos, Tiago (Universidade de Sao Paulo) | Bergamo, Yannick P. (Universidade de Sao Paulo) | Silva, Valdinei Freire da (Universidade de Sao Paulo) | Cozman, Fabio G. (Universidade de Sao Paulo) | Costa, Anna Helena Reali (Universidade de Sao Paulo)
Suppose an agent builds a policy that satisfactorily solves a decision problem; suppose further that some aspects of this policy are abstracted and used as starting point in a new, different decision problem. How can the agent accrue the benefits of the abstract policy in the new concrete problem? In this paper we propose a framework for simultaneous reinforcement learning, where the abstract policy helps start up the policy for the concrete problem, and both policies are refined through exploration. We report experiments that demonstrate that our framework is effective in speeding up policy construction for practical problems.
- North America > United States > California > San Francisco County > San Francisco (0.14)
- South America > Brazil > São Paulo (0.04)
- North America > United States > New York (0.04)
- Europe > Netherlands > North Holland > Amsterdam (0.04)