HAWAII: Hierarchical Visual Knowledge Transfer for Efficient Vision-Language Models