Adaptive gradient descent without descent