Goto

Collaborating Authors

 Reinforcement Learning





TowardsPlayingFullMOBAGameswith DeepReinforcementLearning

Neural Information Processing Systems

As aresult, full MOBAgames without restrictions are farfrom being mastered by any existing AI system. In this paper, we propose a MOBA AIlearning paradigm that methodologically enables playing full MOBAgames withdeepreinforcementlearning.Specifically,wedevelopacombinationofnovel and existing learning techniques, including curriculum self-play learning, policy distillation, off-policy adaption, multi-head value estimation, and Monte-Carlo tree-search, intraining andplaying alargepoolofheroes,meanwhile addressing thescalabilityissueskillfully.





UnderstandingEnd-to-EndModel-Based ReinforcementLearningMethodsasImplicit Parameterization

Neural Information Processing Systems

While knowntobesample efficient, these methods havefailed tofully leverage recent advances indeep learning, forcing the use of less efficient but more scalable model-free methods which try to learn the values directly.


UnderstandingEnd-to-EndModel-Based ReinforcementLearningMethodsasImplicit Parameterization

Neural Information Processing Systems

While knowntobesample efficient, these methods havefailed tofully leverage recent advances indeep learning, forcing the use of less efficient but more scalable model-free methods which try to learn the values directly.