MAC: An Efficient Gradient Preconditioning using Mean Activation Approximated Curvature