6fb52e71b837628ac16539c1ff911667-AuthorFeedback.pdf

Neural Information Processing Systems 

Specifically, we use F1 score with average='weighted' from sklearn, this averages F1 score from each class