CLIP-TD: CLIP Targeted Distillation for Vision-Language Tasks