Robust Visual Reasoning via Language Guided Neural Module Networks
–Neural Information Processing Systems
Neural module networks (NMN) are a popular approach for solving multi-modal tasks such as visual question answering (VQA) and visual referring expression recognition (REF). A key limitation in prior implementations of NMN is that the neural modules do not effectively capture the association between the visual input and the relevant neighbourhood context of the textual input.
Neural Information Processing Systems
Dec-24-2025, 04:18:43 GMT
- Technology:
- Information Technology > Artificial Intelligence > Vision (0.59)