SmartCLIP: Modular Vision-language Alignment with Identification Guarantees

Open in new window