UniSkill: Imitating Human Videos via Cross-Embodiment Skill Representations