Equivariant and Invariant Grounding for Video Question Answering

Open in new window