Polarity-Aware Probing for Quantifying Latent Alignment in Language Models