Fusion and Cross-Modal Transfer for Zero-Shot Human Action Recognition