Continuous-time Value Function Approximation in Reproducing Kernel Hilbert Spaces
Motoya Ohnishi, Masahiro Yukawa, Mikael Johansson, Masashi Sugiyama
–Neural Information Processing Systems
Motivated by the success of reinforcement learning (RL) for discrete-time tasks such as AlphaGo and Atari games, there has been a recent surge of interest in using RL for continuous-time control of physical systems (cf.
Neural Information Processing Systems
Oct-7-2024, 13:32:24 GMT
- Country:
- North America (0.28)
- Genre:
- Overview (0.46)
- Industry:
- Leisure & Entertainment > Games > Computer Games (0.54)
- Technology: