Self-supervised Extraction of Human Motion Structures via Frame-wise Discrete Features

Open in new window