Adai: Separating the Effects of Adaptive Learning Rate and Momentum Inertia