IFCap: Image-like Retrieval and Frequency-based Entity Filtering for Zero-shot Captioning

Open in new window