Improved Convergence Rates for Non-Convex Federated Learning with Compression