CLIP Model for Images to Textual Prompts Based on Top-k Neighbors