Coarse-to-Fine Covid-19 Segmentation via Vision-Language Alignment