Beyond Visual Similarity: Rule-Guided Multimodal Clustering with explicit domain rules

Open in new window