Goto

Collaborating Authors

 continuous time and discrete space


Review for NeurIPS paper: POMDPs in Continuous Time and Discrete Spaces

Neural Information Processing Systems

Summary and Contributions: Post-rebuttal: I would like to thank the authors for the thoughtful response. The main issue for me was clarity, and I'm happy that the authors agreed to improve this aspect of the paper. However, it's hard to increase my score based on this promise alone. Nevertheless, my recommendation should really be considered a borderline recommendation. I will not fight against accepting this paper. This involves both filtering and control.


Review for NeurIPS paper: POMDPs in Continuous Time and Discrete Spaces

Neural Information Processing Systems

The paper describes new offline and online techniques to optimize the policy of continuous time discrete state and action POMDPs. This paper makes an important contribution to the RL and control literature. Very little work has focused on continuous time control problems in the ML community. While the techniques assume that the model is known, do not scale to high dimensional problems and were tested only on toy problems, they introduce new formalisms that will help the community get familiar with the mathematics of continuous time control. Hence this paper will be of high interest for the RL community.


POMDPs in Continuous Time and Discrete Spaces

Neural Information Processing Systems

Many processes, such as discrete event systems in engineering or population dynamics in biology, evolve in discrete space and continuous time. We consider the problem of optimal decision making in such discrete state and action space systems under partial observability. This places our work at the intersection of optimal filtering and optimal control. At the current state of research, a mathematical description for simultaneous decision making and filtering in continuous time with finite state and action spaces is still missing. In this paper, we give a mathematical description of a continuous-time partial observable Markov decision process (POMDP).