Sequential Representation Learning via Static-Dynamic Conditional Disentanglement

Simon, Mathieu Cyrille, Frossard, Pascal, De Vleeschouwer, Christophe

Aug-10-2024–arXiv.org Artificial Intelligence

This paper explores self-supervised disentangled representation learning within sequential data, focusing on separating time-independent and time-varying factors in videos. We propose a new model that breaks the usual independence assumption between those factors by explicitly accounting for the causal relationship between the static/dynamic variables and that improves the model expressivity through additional Normalizing Flows. A formal definition of the factors is proposed. This formalism leads to the derivation of sufficient conditions for the ground truth factors to be identifiable, and to the introduction of a novel theoretically grounded disentanglement constraint that can be directly and efficiently incorporated into our new framework. The experiments show that the proposed approach outperforms previous complex state-of-the-art techniques in scenarios where the dynamics of a scene are influenced by its content.

arxiv preprint arxiv, disentanglement, representation, (14 more...)

arXiv.org Artificial Intelligence

Aug-10-2024

arXiv.org PDF

Add feedback

Country:
- Europe
  - Switzerland > Vaud
    - Lausanne (0.04)
  - Belgium > Wallonia
    - Walloon Brabant > Louvain-la-Neuve (0.04)

Genre:
- Research Report > Promising Solution (0.48)

Technology:
- Information Technology > Artificial Intelligence
  - Vision (0.93)
  - Representation & Reasoning (0.93)
  - Machine Learning > Neural Networks (0.70)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found