Diverse Image Captioning with Context-Object Split Latent Spaces

Oct-2-2025, 11:57:39 GMT–Neural Information Processing Systems

Figure 1: Context-object split latent space of our COS-CV AE to exploit similarities in the contextual annotations for diverse captioning.

caption, machine learning, natural language, (20 more...)

Neural Information Processing Systems

Oct-2-2025, 11:57:39 GMT

Conferences PDF

Technology:
- Information Technology > Artificial Intelligence
  - Vision (1.00)
  - Natural Language (1.00)
  - Representation & Reasoning (0.69)
  - Machine Learning > Neural Networks
    - Deep Learning (0.50)

Duplicate Docs Excel Report

Title
24bea84d52e6a1f8025e313c2ffff50a-Paper.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found