Chirality in Action: Time-Aware Video Representation Learning by Latent Straightening
–Neural Information Processing Systems
Our objective is to develop compact video representations that are sensitive to visual change over time. To measure such time-sensitivity, we introduce a new task: chiral action recognition, where one needs to distinguish between a pair of temporally opposite actions, such as "opening vs. closing a door", "approaching vs. moving away from something", "folding vs. unfolding paper", etc. Such actions (i) occur frequently in everyday life, (ii) require understanding of simple visual change over time (in object state, size, spatial position, count . . .
Neural Information Processing Systems
Jun-19-2026, 07:37:04 GMT