Are Intermediate Layers and Labels Really Necessary? A General Language Model Distillation Method