AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation