Commonsense Video Question Answering through Video-Grounded Entailment Tree Reasoning

Open in new window