TACOS: Temporally-aligned Audio CaptiOnS for Language-Audio Pretraining

Open in new window