Modality-Independent Teachers Meet Weakly-Supervised Audio-Visual Event Parser
–Neural Information Processing Systems
Audio-visual learning has been a major pillar of multi-modal machine learning, where the community mostly focused on its modality-aligned setting, i.e ., the
Neural Information Processing Systems
Feb-17-2026, 18:02:22 GMT
- Country:
- Asia
- Japan > Honshū
- Chūbu > Ishikawa Prefecture > Kanazawa (0.04)
- South Korea > Gyeonggi-do
- Suwon (0.04)
- Taiwan (0.04)
- Japan > Honshū
- Asia
- Genre:
- Research Report > New Finding (0.46)
- Industry:
- Information Technology > Security & Privacy (0.93)
- Technology:
- Information Technology
- Artificial Intelligence
- Machine Learning > Neural Networks (1.00)
- Natural Language (0.94)
- Vision (1.00)
- Security & Privacy (0.93)
- Artificial Intelligence
- Information Technology