OPEL: Optimal Transport Guided ProcedurE Learning
–Neural Information Processing Systems
Procedure learning refers to the task of identifying the key-steps and determining their logical order, given several videos of the same task. For both third-person and first-person (egocentric) videos, state-of-the-art (SOTA) methods aim at finding correspondences across videos in time to accomplish procedure learning. However, to establish temporal relationships within the sequences, these methods often rely on frame-to-frame mapping, or assume monotonic alignment of video pairs, leading to sub-optimal results. To this end, we propose to treat the video frames as samples from an unknown distribution, enabling us to frame their distance calculation as an optimal transport (OT) problem. Notably, the OTbased formulation allows us to relax the previously mentioned assumptions.
Neural Information Processing Systems
May-29-2025, 22:38:19 GMT
- Country:
- North America > United States > Indiana > Tippecanoe County (0.14)
- Genre:
- Research Report
- Experimental Study (0.93)
- New Finding (0.67)
- Workflow (0.67)
- Research Report
- Industry:
- Technology:
- Information Technology
- Artificial Intelligence
- Machine Learning
- Neural Networks (0.68)
- Statistical Learning (0.46)
- Natural Language (1.00)
- Representation & Reasoning (1.00)
- Vision (1.00)
- Machine Learning
- Data Science (0.93)
- Artificial Intelligence
- Information Technology