Visually-Aware Audio Captioning With Adaptive Audio-Visual Attention

Open in new window