Reviews: Learning Hawkes Processes from a handful of events

Neural Information Processing Systems 

I have read the rebuttal and other reviewer's comments. Please elaborate them in more details as needed in the revision. The scarcity of the data is caused by the naturally smaller intensity function. If one dimension has a harder to learn triggering function, it may propagate onto other dimensions. In prior works, such cases are hard to handle, and I think the method proposed here could be used as a remedy to this issue.