Supplementary Material for PointT AD: Multi-Label Temporal Action Detection with Learnable Query Points Jing Tan

Neural Information Processing Systems 

This paper proposes PointT AD, a solution with learnable query points to tackle multi-label T AD. PointT AD is the first to introduce points/keyframes for segment-level video representation.