Learning Dynamics of Attention: Human Prior for Interpretable Machine Reasoning

Oct-10-2024, 17:50:25 GMT–Neural Information Processing Systems

Without relevant human priors, neural networks may learn uninterpretable features. We propose Dynamics of Attention for Focus Transition (DAFT) as a human prior for machine reasoning. DAFT is a novel method that regularizes attention-based reasoning by modelling it as a continuous dynamical system using neural ordinary differential equations. As a proof of concept, we augment a state-of-the-art visual reasoning model with DAFT. Our experiments reveal that applying DAFT yields similar performance to the original model while using fewer reasoning steps, showing that it implicitly learns to skip unnecessary steps.

interpretable machine reasoning, learning dynamic, transition, (1 more...)

Neural Information Processing Systems

Oct-10-2024, 17:50:25 GMT

Conferences Web Page

Add feedback

Genre:
- Research Report > Promising Solution (0.65)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (0.45)