Unsupervised Motion Representation Learning with Capsule Autoencoders

Oct-9-2024, 16:08:21 GMT–Neural Information Processing Systems

We propose the Motion Capsule Autoencoder (MCAE), which addresses a key challenge in the unsupervised learning of motion representations: transformation invariance. In the lower level, a spatio-temporal motion signal is divided into short, local, and semantic-agnostic snippets. In the higher level, the snippets are aggregated to form full-length semantic-aware segments. For both levels, we represent motion with a set of learned transformation invariant templates and the corresponding geometric transformations by using capsule autoencoders of a novel design. This leads to a robust and efficient encoding of viewpoint changes.

capsule autoencoder, snippet, unsupervised motion representation learning, (1 more...)

Neural Information Processing Systems

Oct-9-2024, 16:08:21 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.92)