Statistical guarantees for continuous-time policy evaluation: blessing of ellipticity and new tradeoffs

Open in new window