DOA-Aware Audio-Visual Self-Supervised Learning for Sound Event Localization and Detection