Adaptive Cross-Modal Few-shot Learning