Belief Revision based Caption Re-ranker with Visual Semantic Information
Sabir, Ahmed, Moreno-Noguer, Francesc, Madhyastha, Pranava, Padró, Lluís
–arXiv.org Artificial Intelligence
In this work, we focus on improving the captions generated by image-caption generation systems. We propose a novel re-ranking approach that leverages visual-semantic measures to identify the ideal caption that maximally captures the visual information in the image. Our re-ranker utilizes the Belief Revision framework (Blok et al., 2003) to calibrate the original likelihood of the top-n captions by explicitly exploiting the semantic relatedness between the depicted caption and the visual context. Our experiments demonstrate the utility of our approach, where we observe that our re-ranker can enhance the performance of a typical image-captioning system without the necessity of any additional training or fine-tuning.
arXiv.org Artificial Intelligence
Sep-16-2022
- Country:
- Europe
- United Kingdom > England
- Greater London > London (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- United Kingdom > England
- Asia > China
- Heilongjiang Province > Daqing (0.04)
- Europe
- Genre:
- Research Report > New Finding (0.46)
- Industry:
- Transportation > Ground
- Road (0.46)
- Leisure & Entertainment > Sports
- Tennis (0.46)
- Transportation > Ground
- Technology: