Provably Efficient Q-learning with Function Approximation via Distribution Shift Error Checking Oracle

Simon S. Du, Yuping Luo, Ruosong Wang, Hanrui Zhang

Neural Information Processing Systems 

Neural Information Processing Systems http://nips.cc/

Similar Docs  Excel Report  more

TitleSimilaritySource
None found