Deep Reinforcement Learning of Marked Temporal Point Processes

Utkarsh Upadhyay, Abir De, Manuel Gomez Rodriguez

Neural Information Processing Systems 

In doing so, we define the agent's policy using the intensity and