Improve Cross-Architecture Generalization on Dataset Distillation