An Improved Relative Self-Attention Mechanism for Transformer with Application to Music Generation