Self-supervised Audio Spatialization with Correspondence Classifier