Automatic Channel Pruning for Multi-Head Attention