Zero-Shot Detection via Vision and Language Knowledge Distillation