STAR: Spatial-Temporal Tracklet Matching for Multi-Object Tracking

Neural Information Processing Systems 

Existing tracking-by-detection Multi-Object Tracking methods mainly rely on associating objects with tracklets using motion and appearance features. However, variations in viewpoint and occlusions can result in discrepancies between the features of current objects and those of historical tracklets. To tackle these challenges, this paper proposes a novel Spatial-Temporal Tracklet Graph Matching paradigm (STAR). The core idea of STAR is to achieve long-term, reliable object association through the association of ``tracklet clips (TCs). TCs are segments of confidently associated multi-object trajectories, which are linked through graph matching.