Language Models are Causal Knowledge Extractors for Zero-shot Video Question Answering

Open in new window