Adaptively Aligned Image Captioning via Adaptive Attention Time

Lun Huang, Wenmin Wang, Yaxian Xia, Jie Chen

Feb-15-2026, 10:09:11 GMT–Neural Information Processing Systems

AATallowstheframeworktolearn howmany attention steps to take to output a caption word at each decoding step. With AAT, an image region can be mapped to an arbitrary number of caption words while a caption word can also attend to an arbitrary number of image regions. AAT is deterministic and differentiable, and doesn't introduce any noise to the parameter gradients.

artificial intelligence, attention step, machine learning, (15 more...)

Neural Information Processing Systems

Feb-15-2026, 10:09:11 GMT

Conferences PDF

Add feedback

Country:
- North America > Canada
  - British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
- Asia > China
  - Guangdong Province > Shenzhen (0.04)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.32)

Duplicate Docs Excel Report

Title
Adaptively Aligned Image Captioning via Adaptive Attention Time

Similar Docs Excel Report more

Title	Similarity	Source
None found