Towards Video Text Visual Question Answering: Benchmark and Baseline

Neural Information Processing Systems 

As mentioned in our paper, M4-ViteVQA has 9 categories.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found