Efficient Large-scale Audio Tagging via Transformer-to-CNN Knowledge Distillation