SmartCLIP: Modular Vision-language Alignment with Identification Guarantees - Databubble