Enhancing Self-Supervised Learning with Semantic Pairs A New Dataset and Empirical Study