MAGNET: AMulti-agent Framework for Finding Audio-Visual Needles by Reasoning over Multi-Video Haystacks

Open in new window