Task Attribute Distance for Few-Shot Learning: Theoretical Analysis and Applications