Understanding When and Why Graph Attention Mechanisms Work via Node Classification