Interacted Object Grounding in Spatio-Temporal Human-Object Interactions