Enhancing Automated Audio Captioning via Large Language Models with Optimized Audio Encoding

Open in new window