Reviews: Latent Alignment and Variational Attention

Oct-8-2024, 01:41:46 GMT–Neural Information Processing Systems

Update based on author rebuttal: I believe the authors have addressed the main criticisms of this paper (not clear how it's different from prior work) and have also provided additional experiments. I've raised my score accordingly. This paper focuses on using variational inference to train models with a "latent" (stochastic) attention mechanism. They consider attention as a categorical or dirichlet random variable, and explore using posterior inference on alignment decisions. They test various approaches on NMT and visual question answering datasets.

attention-based model, domain structure, latent alignment and variational attention, (6 more...)

Neural Information Processing Systems

Oct-8-2024, 01:41:46 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (0.53)