Goto

Collaborating Authors

 Reinforcement Learning


benchmarks (Freeman et al., 2021) show that T A

Neural Information Processing Systems

However, various systems are inherently continuous in time, making discrete-time MDPs an inexact modeling choice. In many applications, such as greenhouse control or medical treatments, each interaction (measurement or switching of action) involves manual intervention and thus is inherently costly.