The Devil is in the Details: On the Pitfalls of Event Extraction Evaluation