Constructing Hierarchical Q&A Datasets for Video Story Understanding
Heo, Yu-Jung, On, Kyoung-Woon, Choi, Seongho, Lim, Jaeseo, Kim, Jinah, Ryu, Jeh-Kwang, Bae, Byung-Chull, Zhang, Byoung-Tak
–arXiv.org Artificial Intelligence
Video understanding is emerging as a new paradigm for studying human-like AI. Question-and-Answering (Q&A) is used as a general benchmark to measure the level of intelligence for video understanding. While several previous studies have suggested datasets for video Q&A tasks, they did not really incorporate story-level understanding, resulting in highly-biased and lack of variance in degree of question difficulty. In this paper, we propose a hierarchical method for building Q&A datasets, i.e. hierarchical difficulty levels. We introduce three criteria for video story understanding, i.e. memory capacity, logical complexity, and DIKW (Data-Information-Knowledge-Wisdom) pyramid. We discuss how three-dimensional map constructed from these criteria can be used as a metric for evaluating the levels of intelligence relating to video story understanding.
arXiv.org Artificial Intelligence
Apr-1-2019
- Country:
- Asia > South Korea
- North America > United States
- Hawaii (0.04)
- Genre:
- Research Report (0.84)
- Industry:
- Education (0.93)
- Technology: