Corpus Wide Argument Mining -- a Working Solution

Ein-Dor, Liat, Shnarch, Eyal, Dankin, Lena, Halfon, Alon, Sznajder, Benjamin, Gera, Ariel, Alzate, Carlos, Gleize, Martin, Choshen, Leshem, Hou, Yufang, Bilu, Yonatan, Aharonov, Ranit, Slonim, Noam

Nov-25-2019–arXiv.org Artificial Intelligence

One of the main tasks in argument mining is the retrieval of argumentative content pertaining to a given topic. Most previous work addressed this task by retrieving a relatively small number of relevant documents as the initial source for such content. This line of research yielded moderate success, which is of limited use in a real-world system. Furthermore, for such a system to yield a comprehensive set of relevant arguments, over a wide range of topics, it requires leveraging a large and diverse corpus in an appropriate manner. Here we present a first end-to-end high-precision, corpus-wide argument mining system. This is made possible by combining sentence-level queries over an appropriate indexing of a very large corpus of newspaper articles, with an iterative annotation scheme. This scheme addresses the inherent label bias in the data and pinpoints the regions of the sample space whose manual labeling is required to obtain high-precision among top-ranked candidates. 1 Introduction Starting with the seminal work of Mochales Palau and Moens (2009), argument mining has mainly focused on the following tasks - identifying argumentative text segments within a given document; labeling these text segments according to the type of argument and its stance; and elucidating the discourse relations among the detected arguments. Typically, the considered documents were argumentative in nature, taken from a well defined domain, such as legal documents or student essays. More recently, some attention had been given to the corresponding retrieval task - given a controversial topic, retrieve arguments with a clear stance towards this topic. This is usually done by first retrieving - manually or automatically - documents relevant to the topic, and then using argument mining techniques to identify relevant argumentative segments therein. This documents-based approach was originally explored over Wikipedia (Levy et al. 2014; Rinott et al. 2015), and more recently over the entire Web (Stab et al. 2018). For an argument retrieval system to be of practical use requires: (1) high precision, and (2) wide coverage.

argument, benchmark, precision, (16 more...)

arXiv.org Artificial Intelligence

Nov-25-2019

arXiv.org PDF

Add feedback

Country:
- Oceania > Palau (0.25)
- North America > United States
  - Maryland > Baltimore (0.04)
- Europe > United Kingdom
  - England > Cambridgeshire > Cambridge (0.04)
- Asia > Middle East
  - Jordan (0.04)

Genre:
- Research Report (1.00)

Industry:
- Law (1.00)
- Media > News (0.34)
- Education > Curriculum
  - Subject-Specific Education (0.34)

Technology:
- Information Technology
  - Communications (1.00)
  - Artificial Intelligence
    - Natural Language (1.00)
    - Machine Learning
      - Inductive Learning (1.00)
      - Neural Networks (0.94)
      - Statistical Learning (0.94)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found