Deep Reinforcement Learning of Marked Temporal Point Processes