Few-Shot Learning from Gigapixel Images via Hierarchical Vision-Language Alignment and Modeling

Open in new window