Knowledge Transfer via Multi-Head Feature Adaptation for Whole Slide Image Classification