Huge AI models can be halved in size without degrading performance