Cross-Genre Argument Mining: Can Language Models Automatically Fill in Missing Discourse Markers?
Rocha, Gil, Cardoso, Henrique Lopes, Belouadi, Jonas, Eger, Steffen
–arXiv.org Artificial Intelligence
Available corpora for Argument Mining differ along several axes, and one of the key differences is the presence (or absence) of discourse markers to signal argumentative content. Exploring effective ways to use discourse markers has received wide attention in various discourse parsing tasks, from which it is well-known that discourse markers are strong indicators of discourse relations. To improve the robustness of Argument Mining systems across different genres, we propose to automatically augment a given text with discourse markers such that all relations are explicitly signaled. Our analysis unveils that popular language models taken out-of-the-box fail on this task; however, when fine-tuned on a new heterogeneous dataset that we construct (including synthetic and real examples), they perform considerably better. We demonstrate the impact of our approach on an Argument Mining downstream task, evaluated on different corpora, showing that language models can be trained to automatically fill in discourse markers across different corpora, improving the performance of a downstream model in some, but not all, cases. Our proposed approach can further be employed as an assistive tool for better discourse understanding.
arXiv.org Artificial Intelligence
Jun-7-2023
- Country:
- Oceania > Australia
- North America
- Dominican Republic (0.04)
- United States
- Texas > Travis County
- Austin (0.04)
- New York > New York County
- New York City (0.04)
- New Mexico > Santa Fe County
- Santa Fe (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Colorado > Denver County
- Denver (0.04)
- Texas > Travis County
- Canada > British Columbia
- Europe
- Germany > Berlin (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- Italy
- Tuscany > Florence (0.05)
- Umbria > Perugia Province
- Perugia (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Netherlands > South Holland
- Dordrecht (0.04)
- Sweden > Vaestra Goetaland
- Gothenburg (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Portugal
- Asia
- China > Hong Kong (0.04)
- Japan > Kyūshū & Okinawa
- Kyūshū > Miyazaki Prefecture > Miyazaki (0.04)
- Africa > Middle East
- Morocco (0.04)
- Genre:
- Research Report > New Finding (0.46)
- Industry:
- Law (0.93)
- Technology: