Learning Distinct and Representative Modes for Image Captioning Qi Chen Chaorui Deng Qi Wu Australian Institute for Machine Learning, University of Adelaide
–Neural Information Processing Systems
However, recent findings show that the captions generated by these methods tend to be biased toward the "average" caption that only captures the most general mode ( a.k.a,
Neural Information Processing Systems
Nov-14-2025, 01:22:30 GMT