Multi-View Self-Attention Based Transformer for Speaker Recognition