AITopics | Taboada, Maite

Collaborating Authors

Taboada, Maite

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Dimensions of Online Conflict: Towards Modeling Agonism

Canute, Matt, Jin, Mali, holtzclaw, hannah, Lusoli, Alberto, Adams, Philippa R, Pandya, Mugdha, Taboada, Maite, Maynard, Diana, Chun, Wendy Hui Kyong

arXiv.org Artificial IntelligenceNov-6-2023

Agonism plays a vital role in democratic dialogue by fostering diverse perspectives and robust discussions. Within the realm of online conflict there is another type: hateful antagonism, which undermines constructive dialogue. Detecting conflict online is central to platform moderation and monetization. It is also vital for democratic dialogue, but only when it takes the form of agonism. To model these two types of conflict, we collected Twitter conversations related to trending controversial topics. We introduce a comprehensive annotation schema for labelling different dimensions of conflict in the conversations, such as the source of conflict, the target, and the rhetorical strategies deployed. Using this schema, we annotated approximately 4,000 conversations with multiple labels. We then trained both logistic regression and transformer-based models on the dataset, incorporating context from the conversation, including the number of participants and the structure of the interactions. Results show that contextual labels are helpful in identifying conflict and make the models robust to variations in topic. Our research contributes a conceptualization of different dimensions of conflict, a richly annotated dataset, and promising results that can contribute to content moderation.

artificial intelligence, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2311.03584

Country:

Europe (1.00)
Asia (0.94)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)

Genre: Research Report > New Finding (0.86)

Industry:

Law (1.00)
Health & Medicine > Therapeutic Area (1.00)
Government (0.93)
Information Technology (0.93)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.66)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.66)

Add feedback

Radar de Parit\'e: An NLP system to measure gender representation in French news stories

Soumah, Valentin-Gabriel, Rao, Prashanth, Eibl, Philipp, Taboada, Maite

arXiv.org Artificial IntelligenceApr-19-2023

We present the Radar de Parité, an automated Natural Language Processing (NLP) system that measures the proportion of women and men quoted daily in six Canadian French-language media outlets. We outline the system's architecture and detail the challenges we overcame to address French-specific issues, in particular regarding coreference resolution, a new contribution to the NLP literature on French. Our results highlight the underrepresentation of women in news stories, while also illustrating the application of modern NLP methods to measure gender representation and address societal issues. The commonality in most applied NLP research projects is the need to reliably and scalably extract information from unstructured text data. In this paper, we describe one such application: extracting quotes from news stories to quantify gender representation. Gender representation in the media is a long debated topic. From the 1970s, there have been studies into how much women and gender-diverse people are portrayed in news stories, with the general hypothesis that they tend to be underrepresented [1, 2]. There is also research studying how they are represented, i.e., whether sexist or homophobic tropes are present when we discuss women and gender-diverse people [3, 4]. In this work, we tackle one specific aspect of representation: who is quoted and in what proportions. Our starting hypothesis is that we hear less from women than from men in news stories, that is, that men are quoted more often than is to be expected from their proportion in the general population. To fully answer this question, we formulate a quantitative approach, collecting large amounts of representative data and extracting quotes from the unstructured text. This is the goal of the Radar de Parité. We define quotes as either direct or indirect reproductions of what a person said, and we define that person as a source in news articles. In order to extract quotes, we employ a full NLP pipeline, focusing on parsing to identify speakers, verbs, and quotes, in each news story. We then predict the gender of the speaker (or source), using external genderprediction services.

artificial intelligence, natural language, text processing, (17 more...)

arXiv.org Artificial Intelligence

2304.09982

Country: North America > Canada (0.47)

Genre: Research Report > New Finding (0.34)

Industry: Media > News (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.91)

Add feedback