SOPE: Spectrum of Off-Policy Estimators

Neural Information Processing Systems 

Consequently, if the parameterization is not rich enough, then it may not be possible to represent the distribution ratios accurately, and when using rich function approximators (such as neural networks) then the optimization procedure may get stuck in sub-optimal saddle points.