Mismatched NoMore: JointModel-PolicyOptimizationforModel-BasedRL

Neural Information Processing Systems 

A version ofthis bound becomes tight under certain assumptions.