A High Probability Analysis of Adaptive SGD with Momentum