The Promise of Hierarchical Reinforcement Learning
This top-down planning approach decides what a good subgoal is before planning to achieve it." "For complex, high-dimensional Markov Decision Processes (MDPs), it may be necessary to represent the policy with function approximation. A problem is mis- specified whenever, the representation cannot express any policy with acceptable performance.
Mar-9-2019, 23:58:49 GMT
- Country:
- Europe > United Kingdom > England (0.28)
- Genre:
- Research Report (0.68)
- Overview (0.46)
- Industry:
- Education (0.94)
- Leisure & Entertainment > Games
- Computer Games (0.47)