AITopics | clevrer-human

Supplementary Material for CLEVRER-Humans: Describing Physical and Causal Events the Human Way Jiayuan Mao MIT Xuelin Y ang

Neural Information Processing SystemsFeb-8-2026, 05:56:59 GMT

We bear all responsibility in case of violation of rights. The rest of this supplementary document is organized as the following. Next, in Section C, we describe the user interface for dataset collection. On average, we can obtain 29.4 descriptions per video, highlighting the advantage of our First, CLEVRER-Humans contains dense annotations of causal relations between physical events. The outer circle represents the general event families. We have lemmatized all verbs to remove the tense.

artificial intelligence, machine learning, natural language, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.31)

Add feedback

CLEVRER-Humans: Describing Physical and Causal Events the Human Way

Neural Information Processing SystemsFeb-8-2026, 05:56:55 GMT

Building machines that can reason about physical events and their causal relationships is crucial for flexible interaction with the physical world.

artificial intelligence, machine learning, natural language, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.96)

Add feedback

CLEVRER-Humans: Describing Physical and Causal Events the Human Way

Neural Information Processing SystemsDec-24-2025, 00:13:55 GMT

Building machines that can reason about physical events and their causal relationships is crucial for flexible interaction with the physical world. However, most existing physical and causal reasoning benchmarks are exclusively based on synthetically generated events and synthetic natural language descriptions of the causal relationships. This design brings up two issues. First, there is a lack of diversity in both event types and natural language descriptions; second, causal relationships based on manually-defined heuristics are different from human judgments. To address both shortcomings, we present the CLEVRER-Humans benchmark, a video reasoning dataset for causal judgment of physical events with human labels. We employ two techniques to improve data collection efficiency: first, a novel iterative event cloze task to elicit a new representation of events in videos, which we term Causal Event Graphs (CEGs); second, a data augmentation technique based on neural language generative models. We convert the collected CEGs into questions and answers to be consistent with prior work. Finally, we study a collection of baseline approaches for CLEVRER-Humans question-answering, highlighting great challenges set forth by our benchmark.

clevrer-human, name change, physical and causal event, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Natural Language (1.00)

Add feedback

Supplementary Material for CLEVRER-Humans: Describing Physical and Causal Events the Human Way Jiayuan Mao MIT Xuelin Y ang

Neural Information Processing SystemsAug-14-2025, 04:47:20 GMT

We bear all responsibility in case of violation of rights. The rest of this supplementary document is organized as the following. Next, in Section C, we describe the user interface for dataset collection. On average, we can obtain 29.4 descriptions per video, highlighting the advantage of our First, CLEVRER-Humans contains dense annotations of causal relations between physical events. The outer circle represents the general event families. We have lemmatized all verbs to remove the tense.

causal relation, clevrer-human, video, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.31)

Add feedback

32fea358f4811ec6d703e8c17028ce1d-Paper-Datasets_and_Benchmarks.pdf

Neural Information Processing SystemsAug-14-2025, 04:47:17 GMT

dataset, event description, video, (12 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.95)

Add feedback

CLEVRER-Humans: Describing Physical and Causal Events the Human Way

Neural Information Processing SystemsOct-10-2024, 14:11:01 GMT

Building machines that can reason about physical events and their causal relationships is crucial for flexible interaction with the physical world. However, most existing physical and causal reasoning benchmarks are exclusively based on synthetically generated events and synthetic natural language descriptions of the causal relationships. This design brings up two issues. First, there is a lack of diversity in both event types and natural language descriptions; second, causal relationships based on manually-defined heuristics are different from human judgments. To address both shortcomings, we present the CLEVRER-Humans benchmark, a video reasoning dataset for causal judgment of physical events with human labels.

causal relationship, clevrer-human, physical and causal event, (3 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Natural Language (0.87)

Add feedback

CLEVRER-Humans: Describing Physical and Causal Events the Human Way

Mao, Jiayuan, Yang, Xuelin, Zhang, Xikun, Goodman, Noah D., Wu, Jiajun

arXiv.org Machine LearningOct-5-2023

Building machines that can reason about physical events and their causal relationships is crucial for flexible interaction with the physical world. However, most existing physical and causal reasoning benchmarks are exclusively based on synthetically generated events and synthetic natural language descriptions of causal relationships. This design brings up two issues. First, there is a lack of diversity in both event types and natural language descriptions; second, causal relationships based on manually-defined heuristics are different from human judgments. To address both shortcomings, we present the CLEVRER-Humans benchmark, a video reasoning dataset for causal judgment of physical events with human labels. We employ two techniques to improve data collection efficiency: first, a novel iterative event cloze task to elicit a new representation of events in videos, which we term Causal Event Graphs (CEGs); second, a data augmentation technique based on neural language generative models. We convert the collected CEGs into questions and answers to be consistent with prior work. Finally, we study a collection of baseline approaches for CLEVRER-Humans question-answering, highlighting the great challenges set forth by our benchmark.

artificial intelligence, machine learning, natural language, (20 more...)

arXiv.org Machine Learning

2310.03635

Country: