Improving Graph Attention Networks with Large Margin-based Constraints