SPACE: Noise Contrastive Estimation Stabilizes Self-Play Fine-Tuning for Large Language Models