DSD$^2$: Can We Dodge Sparse Double Descent and Compress the Neural Network Worry-Free?