Goto

Collaborating Authors

 Reinforcement Learning




Trust Region-Guided Proximal Policy Optimization

Neural Information Processing Systems

Deep model-free reinforcement learning has achieved great successes in recent years, notably in video games [11], board games [19], robotics [10], and challenging control tasks [17,5].



Better Exploration with Optimistic Actor Critic

Neural Information Processing Systems

Actor-critic methods, a type of model-free Reinforcement Learning, have beensuccessfully applied to challenging tasks in continuous control, often achievingstate-of-the artperformance.





Learning Latent Process from High-Dimensional Event Sequences via Efficient Sampling

Neural Information Processing Systems

There are plenty of previous studies targeting the problem from different aspects. For temporal point process, agreat number of works [3, 13, 15, 16, 28] attempt to model the intensify function from statistic views, and recent studies harness deep recurrent model [24], generative adversarial network [23] and reinforcement learning [19, 18] to learn the temporal process. These researches mainly focus on one-dimension eventsequences where eacheventpossesses thesame marker.