Unsupervised Learning of Disentangled Representations from Video

Oct-3-2024, 01:37:50 GMT–Neural Information Processing Systems

Our approach leverages the temporal coherence of video and a novel adversarial loss to learn a representation that factorizes each frame into a stationary part and a temporally varying component. The disentangled representation can be used for a range of tasks. For example, applying a standard LSTM to the time-vary components enables prediction of future frames. We evaluate our approach on a range of synthetic and real videos, demonstrating the ability to coherently generate hundreds of steps into the future.

prediction, representation, video, (15 more...)

Neural Information Processing Systems

Oct-3-2024, 01:37:50 GMT

Conferences PDF

Add feedback

Country:
- North America > United States
  - New York (0.04)
  - Massachusetts (0.04)
  - California > Los Angeles County
    - Long Beach (0.04)
- Asia > Middle East
  - Jordan (0.04)

Industry:
- Leisure & Entertainment (0.46)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.60)

Duplicate Docs Excel Report

Title
Unsupervised Learning of Disentangled Representations from Video
Unsupervised Learning of Disentangled Representations from Video

Similar Docs Excel Report more

Title	Similarity	Source
None found