Supplementary Material for CLEVRER-Humans: Describing Physical and Causal Events the Human Way Jiayuan Mao MIT Xuelin Y ang

Neural Information Processing Systems 

We bear all responsibility in case of violation of rights. The rest of this supplementary document is organized as the following. Next, in Section C, we describe the user interface for dataset collection. On average, we can obtain 29.4 descriptions per video, highlighting the advantage of our First, CLEVRER-Humans contains dense annotations of causal relations between physical events. The outer circle represents the general event families. We have lemmatized all verbs to remove the tense.