Dynamic Multi-Target Fusion for Efficient Audio-Visual Navigation