Multi-Modal Scene Graph with Kolmogorov-Arnold Experts for Audio-Visual Question Answering

Open in new window