An Eye for an Ear: Zero-shot Audio Description Leveraging an Image Captioner using Audiovisual Distribution Alignment Hugo Malard Michel Olvera 1 Stéphane Lathuiliere 1
–Neural Information Processing Systems
Neural Information Processing Systems
May-29-2025, 08:17:05 GMT
- Country:
- Europe > Netherlands (0.14)
- Genre:
- Research Report > Experimental Study (0.93)
- Industry:
- Education (0.46)
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning > Neural Networks (1.00)
- Natural Language > Large Language Model (1.00)
- Vision (1.00)
- Information Technology > Artificial Intelligence