AITopics | Commonsense Reasoning

Collaborating Authors

Commonsense Reasoning

Knowledge that Everyone Knows. "People do not walk on their heads." The assertion comes about 900 statements deep into the 527,308 items that comprise the Open Mind common sense database. It's after "Laws are the rules of society" and before "The sky is blue during the day." This collection of mundane facts, which would take more than 20,000 pages to print out, consists entirely of statements so unremarkable they are barely worth stating. Most of us would correctly dismiss them as common sense.
– from D.C. Denison, Guess who's smarter. Boston Globe Online (page hosted at MIT), May 26, 2003.

News Overviews Instructional Materials AI-Alerts Classics

Does It Make Sense? And Why? A Pilot Study for Sense Making and Explanation

Wang, Cunxiang, Liang, Shuailong, Zhang, Yue, Li, Xiaonan, Gao, Tian

arXiv.org Artificial IntelligenceJun-2-2019

Introducing common sense to natural language understanding systems has received increasing research attention. It remains a fundamental question on how to evaluate whether a system has a sense making capability. Existing benchmarks measures commonsense knowledge indirectly and without explanation. In this paper, we release a benchmark to directly test whether a system can differentiate natural language statements that make sense from those that do not make sense. In addition, a system is asked to identify the most crucial reason why a statement does not make sense. We evaluate models trained over large-scale language modeling tasks as well as human performance, showing that there are different challenges for system sense making.

artificial intelligence, knowledge, natural language, (16 more...)

arXiv.org Artificial Intelligence

1906.00363

Country:

North America > United States (0.15)
Asia (0.15)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Commonsense Reasoning (0.70)

Add feedback

Human vs. Muppet: A Conservative Estimate of Human Performance on the GLUE Benchmark

Nangia, Nikita, Bowman, Samuel R.

arXiv.org Artificial IntelligenceJun-1-2019

The GLUE benchmark (Wang et al., 2019b) is a suite of language understanding tasks which has seen dramatic progress in the past year, with average performance moving from 70.0 at launch to 83.9, state of the art at the time of writing (May 24, 2019). Here, we measure human performance on the benchmark, in order to learn whether significant headroom remains for further progress. We provide a conservative estimate of human performance on the benchmark through crowdsourcing: Our annotators are non-experts who must learn each task from a brief set of instructions and 20 examples. In spite of limited training, these annotators robustly outperform the state of the art on six of the nine GLUE tasks and achieve an average score of 87.1. Given the fast pace of progress however, the headroom we observe is quite limited. To reproduce the data-poor setting that our annotators must learn in, we also train the BERT model (Devlin et al., 2019) in limited-data regimes, and conclude that low-resource sentence classification remains a challenge for modern neural network approaches to text understanding.

annotator, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

1905.10425

Country: North America > United States (0.47)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Communications > Social Media > Crowdsourcing (0.48)
Information Technology > Artificial Intelligence > Representation & Reasoning > Commonsense Reasoning (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.34)

Add feedback

Attention Is (not) All You Need for Commonsense Reasoning

Klein, Tassilo, Nabi, Moin

arXiv.org Artificial IntelligenceMay-31-2019

The recently introduced BERT model exhibits strong performance on several language understanding benchmarks. In this paper, we describe a simple re-implementation of BERT for commonsense reasoning. We show that the attentions produced by BERT can be directly utilized for tasks such as the Pronoun Disambiguation Problem and Winograd Schema Challenge. Our proposed attention-guided commonsense reasoning method is conceptually simple yet empirically powerful. Experimental analysis on multiple datasets demonstrates that our proposed system performs remarkably well on all cases while outperforming the previously reported state of the art by a margin. While results suggest that BERT seems to implicitly learn to establish complex relationships between entities, solving commonsense reasoning tasks might require more than unsupervised models learned from huge text corpora.

artificial intelligence, commonsense reasoning, machine learning, (16 more...)

arXiv.org Artificial Intelligence

1905.13497

Country: Europe > Belgium (0.15)

Genre: Research Report (0.84)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Commonsense Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

Commonsense Properties from Query Logs and Question Answering Forums

Romero, Julien, Razniewski, Simon, Pal, Koninika, Pan, Jeff Z., Sakhadeo, Archit, Weikum, Gerhard

arXiv.org Artificial IntelligenceMay-31-2019

Commonsense knowledge about object properties, human behavior and general concepts is crucial for robust AI applications. However, automatic acquisition of this knowledge is challenging because of sparseness and bias in online sources. This paper presents Quasimodo, a methodology and tool suite for distilling commonsense properties from non-standard web sources. We devise novel ways of tapping into search-engine query logs and QA forums, and combining the resulting candidate assertions with statistical cues from encyclopedias, books and image tags in a corroboration step. Unlike prior work on commonsense knowledge bases, Quasimodo focuses on salient properties that are typically associated with certain objects or concepts. Extensive evaluations, including extrinsic use-case studies, show that Quasimodo provides better coverage than state-of-the-art baselines with comparable quality.

assertion, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

1905.10989

Genre: Research Report > Experimental Study (0.68)

Industry:

Education (1.00)
Leisure & Entertainment (0.67)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Commonsense Reasoning (0.88)
(2 more...)

Add feedback

On the Winograd Schema: Situating Language Understanding in the Data-Information-Knowledge Continuum

Saba, Walid (Astound.ai)

AAAI ConferencesMay-15-2019

The Winograd Schema (WS) challenge has been proposed as an alternative to the Turing Test as a test for machine intelligence. In this paper we ‘situate’ the WS challenge in the data-information-knowledge continuum, suggesting in the process what a good WS is. Subsequently, we will argue that the WS is but a special case of a more general phenomenon in language understanding, namely the phenomenon of the ‘missing text’. In particular, we will argue that what we usually call thinking in the process of language understanding almost always involves discovering some missing text - text is rarely explicitly stated but is implicitly assumed as shared background knowledge. As such, we suggest extending the WS challenge to include other linguistic phenomena that also involve discovering the ‘missing text’, such tests metonymy, quantifier scope, lexical disambiguation, and copredication, to name a few.

background knowledge, knowledge, winograd schema, (15 more...)

AAAI Conferences

The Thirty-Second International Flairs Conference

Country:

North America > United States > New York (0.04)
North America > United States > California > San Mateo County > Menlo Park (0.04)
Europe > Czechia > South Moravian Region > Brno (0.04)

Industry:

Leisure & Entertainment (0.68)
Media (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Commonsense Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Issues > Turing's Test (0.68)

Add feedback

SP-10K: A Large-scale Evaluation Set for Selectional Preference Acquisition

Zhang, Hongming, Ding, Hantian, Song, Yangqiu

arXiv.org Artificial IntelligenceMay-14-2019

Selectional Preference (SP) is a commonly observed language phenomenon and proved to be useful in many natural language processing tasks. To provide a better evaluation method for SP models, we introduce SP-10K, a large-scale evaluation set that provides human ratings for the plausibility of 10,000 SP pairs over five SP relations, covering 2,500 most frequent verbs, nouns, and adjectives in American English. Three representative SP acquisition methods based on pseudo-disambiguation are evaluated with SP-10K. To demonstrate the importance of our dataset, we investigate the relationship between SP-10K and the commonsense knowledge in ConceptNet5 and show the potential of using SP to represent the commonsense knowledge. We also use the Winograd Schema Challenge to prove that the proposed new SP relations are essential for the hard pronoun coreference resolution problem.

artificial intelligence, natural language, relation, (17 more...)

arXiv.org Artificial Intelligence

1906.02123

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Commonsense Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)

Add feedback

ASER: A Large-scale Eventuality Knowledge Graph

Zhang, Hongming, Liu, Xin, Pan, Haojie, Song, Yangqiu, Wing-Ki, Cane, Leung, null

arXiv.org Artificial IntelligenceMay-1-2019

Understanding human's language requires complex world knowledge. However, existing large-scale knowledge graphs mainly focus on knowledge about entities while ignoring knowledge about activities, states, or events, which are used to describe how entities or things act in the real world. To fill this gap, we develop ASER (activities, states, events, and their relations), a large-scale eventuality knowledge graph extracted from more than 11-billion-token unstructured textual data. ASER contains 15 relation types belonging to five categories, 194-million unique eventualities, and 64-million unique edges among them. Both human and extrinsic evaluations demonstrate the quality and effectiveness of ASER.

artificial intelligence, machine learning, natural language, (23 more...)

arXiv.org Artificial Intelligence

1905.0027

Country: North America > United States (0.28)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Semantic Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Commonsense Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.95)
(3 more...)

Add feedback

Enabling Robots to Understand Incomplete Natural Language Instructions Using Commonsense Reasoning

Chen, Haonan, Tan, Hao, Kuntz, Alan, Bansal, Mohit, Alterovitz, Ron

arXiv.org Artificial IntelligenceApr-29-2019

Enabling robots to understand instructions provided via spoken natural language would facilitate interaction between robots and people in a variety of settings in homes and workplaces. However, natural language instructions are often missing information that would be obvious to a human based on environmental context and common sense, and hence does not need to be explicitly stated. In this paper, we introduce Language-Model-based Commonsense Reasoning (LMCR), a new method which enables a robot to listen to a natural language instruction from a human, observe the environment around it, and automatically fill in information missing from the instruction using environmental context and a new commonsense reasoning approach. Our approach first converts an instruction provided as unconstrained natural language into a form that a robot can understand by parsing it into verb frames. Our approach then fills in missing information in the instruction by observing objects in its vicinity and leveraging commonsense reasoning. To learn commonsense reasoning automatically, our approach distills knowledge from large unstructured textual corpora by training a language model. Our results show the feasibility of a robot learning commonsense knowledge automatically from web-based textual corpora, and the power of learned commonsense reasoning models in enabling a robot to autonomously perform tasks based on incomplete natural language instructions.

artificial intelligence, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

1904.12907

Country:

North America > United States > Colorado (0.04)
North America > United States > Pennsylvania (0.04)
North America > United States > North Carolina > Orange County > Chapel Hill (0.04)
(3 more...)

Genre: Research Report > New Finding (0.86)

Industry: Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Commonsense Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Exploring Unsupervised Pretraining and Sentence Structure Modelling for Winograd Schema Challenge

Ruan, Yu-Ping, Zhu, Xiaodan, Ling, Zhen-Hua, Shi, Zhan, Liu, Quan, Wei, Si

arXiv.org Artificial IntelligenceApr-21-2019

Winograd Schema Challenge (WSC) was proposed as an AI-hard problem in testing computers' intelligence on common sense representation and reasoning. This paper presents the new state-of-theart on WSC, achieving an accuracy of 71.1%. We demonstrate that the leading performance benefits from jointly modelling sentence structures, utilizing knowledge learned from cutting-edge pretraining models, and performing fine-tuning. We conduct detailed analyses, showing that fine-tuning is critical for achieving the performance, but it helps more on the simpler associative problems. Modelling sentence dependency structures, however, consistently helps on the harder non-associative subset of WSC. Analysis also shows that larger fine-tuning datasets yield better performances, suggesting the potential benefit of future work on annotating more Winograd schema sentences.

artificial intelligence, commonsense reasoning, wsc, (14 more...)

arXiv.org Artificial Intelligence

1904.09705

Country:

North America (0.28)
Asia > China (0.14)

Genre: Research Report (0.82)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Commonsense Reasoning (1.00)

Add feedback

Artificial intelligence learns 'deep thoughts' by playing Pictionary

The Independent - TechFeb-5-2019, 20:40:00 GMT

Scientists are using the popular drawing game Pictionary to teach artificial intelligence common sense. AI researchers at the Allen Institute for Artificial Intelligence (AI2), a non-profit lab in Seattle, developed a version of the game called Iconary in order to teach its AllenAI artificial intelligence abstract concepts from pictures alone. Iconary was made public on 5 February in order to encourage people to play the game with AllenAI. By learning from humans, the researchers hope AllenAI will continue to develop common sense reasoning. "Iconary is one of the first times an AI system is paired in a collaborative game with a human player instead of antagonistically working against them," the Iconary website states.

large language model, machine learning, natural language, (17 more...)

The Independent - Tech

Country: North America > United States (0.84)

Industry:

Government > Military (0.86)
Government > Regional Government > North America Government > United States Government (0.84)
Leisure & Entertainment > Games > Chess (0.57)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Commonsense Reasoning (0.56)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.40)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.40)

Add feedback