Towards Video Text Visual Question Answering: Benchmark and Baseline

Dec-25-2025, 14:08:07 GMT–Neural Information Processing Systems

There are already some text-based visual question answering (TextVQA) benchmarks for developing machine's ability to answer questions based on texts in images in recent years. However, models developed on these benchmarks cannot work effectively in many real-life scenarios (e.g.

benchmark and baseline, name change, proceedings, (4 more...)

Neural Information Processing Systems

Dec-25-2025, 14:08:07 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.44)