Compression-aware Training of Deep Networks