Model-Free Reinforcement Learning with the Decision-Estimation Coefficient Dylan J. Foster

Neural Information Processing Systems 

In what follows, we review the Decision-Estimation Coefficient and Estimation-to-Decisions meta-algorithm (Section 1.2), highlighting opportunities for improvement.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found