Communication Compression for Distributed Learning without Control Variates