StabilizingOff-PolicyQ-LearningviaBootstrapping ErrorReduction

Neural Information Processing Systems 

One of the primary drivers of the success of machine learning methods in open-world perception settings, such ascomputer vision [19]and NLP [8],has been the ability ofhigh-capacity function approximators, suchasdeepneuralnetworks,tolearngeneralizable modelsfromlargeamountsof data.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found