Interview with Xiang Fang: Multi-modal learning and embodied intelligence