LearningDistinctandRepresentativeModes forImageCaptioning

Feb-8-2026, 11:45:47 GMT–Neural Information Processing Systems

While mode collapse is typically a side effect for generative modeling, it is somewhat "welcomed" in SoTA image captioning models as it usually facilitates a higher evaluation performance on reference-based metrics like CIDEr, BLEU and SPICE.

artificial intelligence, caption, machine learning, (18 more...)

Neural Information Processing Systems

Feb-8-2026, 11:45:47 GMT

Conferences PDF

Add feedback

Genre:
- Research Report > New Finding (0.46)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (1.00)

Duplicate Docs Excel Report

Title
3d77c6dcc7f143aa2154e7f4d5e22d68-Paper-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found