Off-policy estimation with adaptively collected data: the power of online learning

Neural Information Processing Systems 

Off-policy estimation: Oftentimes, one needs to estimate the linear functional based on observational data collected from a behavior policy.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found