ad71c82b22f4f65b9398f76d8be4c615-AuthorFeedback.pdf

Neural Information Processing Systems 

We now respond to the major comments are as follows. Take RL with the linear model as an example. More formally, we believe that the key is to prove an analogue of Lemma 5 for the linear model. We will also discuss the work on policy certificates (Dann et al., 2019) in related work section. We will add this discussion ot the next version of the paper.