AITopics | Talat, Zeerak

Collaborating Authors

Talat, Zeerak

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

The Only Way is Ethics: A Guide to Ethical Research with Large Language Models

Ungless, Eddie L., Vitsakis, Nikolas, Talat, Zeerak, Garforth, James, Ross, Björn, Onken, Arno, Kasirzadeh, Atoosa, Birch, Alexandra

arXiv.org Artificial IntelligenceDec-20-2024

There is a significant body of work looking at the ethical considerations of large language models (LLMs): critiquing tools to measure performance and harms; proposing toolkits to aid in ideation; discussing the risks to workers; considering legislation around privacy and security etc. As yet there is no work that integrates these resources into a single practical guide that focuses on LLMs; we attempt this ambitious goal. We introduce 'LLM Ethics Whitepaper', which we provide as an open and living resource for NLP practitioners, and those tasked with evaluating the ethical implications of others' work. Our goal is to translate ethics literature into concrete recommendations and provocations for thinking with clear first steps, aimed at computer scientists. 'LLM Ethics Whitepaper' distils a thorough literature review into clear Do's and Don'ts, which we present also in this paper. We likewise identify useful toolkits to support ethical work. We refer the interested reader to the full LLM Ethics Whitepaper, which provides a succinct discussion of ethical considerations at each stage in a project lifecycle, as well as citations for the hundreds of papers from which we drew our recommendations. The present paper can be thought of as a pocket guide to conducting ethical research with LLMs.

computational linguistic, large language model, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2412.16022

Country:

Europe (1.00)
North America > United States > Minnesota (0.28)

Genre: Overview (1.00)

Industry:

Information Technology > Security & Privacy (1.00)
Government (0.87)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

A Capabilities Approach to Studying Bias and Harm in Language Technologies

Nigatu, Hellina Hailu, Talat, Zeerak

arXiv.org Artificial IntelligenceNov-6-2024

In moving from excluding the majority of the world's languages to blindly adopting what we make for English, we first risk importing the same harms we have at best mitigated and at least measured for English. For instance, Yong et al. [15] showed how prompting GPT-4 in low-resource languages circumvents guardrails that are effective in English. However, in evaluating and mitigating harms arising from adopting new technologies into such contexts, we often disregard (1) the actual community needs of Language Technologies, and (2) biases and fairness issues within the context of the communities. Here, we consider fairness, bias, and inclusion in Language Technologies through the lens of the Capabilities Approach [12]. The Capabilities Approach centers what people are capable of achieving, given their intersectional social, political, and economic contexts instead of what resources are (theoretically) available to them. In the following sections, we detail the Capabilities Approach, its relationship to multilingual and multicultural evaluation, and how the framework affords meaningful collaboration with community members in defining and measuring harms of Language Technologies. 2 THE CAPABILITIES APPROACH The Capabilities Approach is a framework in developmental economic studies proposed by Amartya Sen in a series of articles published as far back as 1974 [1]. It has been applied to varied fields including environmental justice [e.g.

artificial intelligence, large language model, natural language, (16 more...)

arXiv.org Artificial Intelligence

2411.04298

Country: North America > United States > California (0.30)

Genre: Research Report (0.85)

Industry: Law (0.50)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.36)

Add feedback

Ethics Whitepaper: Whitepaper on Ethical Research into Large Language Models

Ungless, Eddie L., Vitsakis, Nikolas, Talat, Zeerak, Garforth, James, Ross, Björn, Onken, Arno, Kasirzadeh, Atoosa, Birch, Alexandra

arXiv.org Artificial IntelligenceOct-17-2024

This whitepaper offers an overview of the ethical considerations surrounding research into or with large language models (LLMs). As LLMs become more integrated into widely used applications, their societal impact increases, bringing important ethical questions to the forefront. With a growing body of work examining the ethical development, deployment, and use of LLMs, this whitepaper provides a comprehensive and practical guide to best practices, designed to help those in research and in industry to uphold the highest ethical standards in their work.

computational linguistic, large language model, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2410.19812

Country:

Europe (1.00)
North America > Canada (0.68)
Asia > Middle East > UAE (0.28)
(2 more...)

Genre:

Overview (1.00)
Research Report > Experimental Study (0.45)

Industry:

Media > News (1.00)
Information Technology > Security & Privacy (1.00)
Health & Medicine (1.00)
(7 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.46)

Add feedback

Understanding "Democratization" in NLP and ML Research

Subramonian, Arjun, Gautam, Vagrant, Klakow, Dietrich, Talat, Zeerak

arXiv.org Artificial IntelligenceJun-17-2024

Recent improvements in natural language processing (NLP) and machine learning (ML) and increased mainstream adoption have led to researchers frequently discussing the "democratization" of artificial intelligence. In this paper, we seek to clarify how democratization is understood in NLP and ML publications, through large-scale mixed-methods analyses of papers using the keyword "democra*" published in NLP and adjacent venues. We find that democratization is most frequently used to convey (ease of) access to or use of technologies, without meaningfully engaging with theories of democratization, while research using other invocations of "democra*" tends to be grounded in theories of deliberation and debate. Based on our findings, we call for researchers to enrich their use of the term democratization with appropriate theory, towards democratic technologies beyond superficial access.

democratization, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2406.11598

Country:

Europe > United Kingdom > England (0.14)
North America > United States > California (0.14)
Asia > Middle East > UAE (0.14)

Genre: Research Report > New Finding (0.48)

Industry:

Government > Voting & Elections (1.00)
Media (0.94)
Law (0.68)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (1.00)

Add feedback

Exploring the Limitations of Detecting Machine-Generated Text

Doughman, Jad, Afzal, Osama Mohammed, Toyin, Hawau Olamide, Shehata, Shady, Nakov, Preslav, Talat, Zeerak

arXiv.org Artificial IntelligenceJun-16-2024

Recent improvements in the quality of the generations by large language models have spurred research into identifying machine-generated text. Systems proposed for the task often achieve high performance. However, humans and machines can produce text in different styles and in different domains, and it remains unclear whether machine generated-text detection models favour particular styles or domains. In this paper, we critically examine the classification performance for detecting machine-generated text by evaluating on texts with varying writing styles. We find that classifiers are highly sensitive to stylistic changes and differences in text complexity, and in some cases degrade entirely to random classifiers. We further find that detection systems are particularly susceptible to misclassify easy-to-read texts while they have high performance for complex texts.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2406.11073

Country:

Europe > Middle East > Malta (0.14)
Europe > Italy (0.14)

Genre: Research Report > New Finding (0.47)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.38)

Add feedback

The Perspectivist Paradigm Shift: Assumptions and Challenges of Capturing Human Labels

Fleisig, Eve, Blodgett, Su Lin, Klein, Dan, Talat, Zeerak

arXiv.org Artificial IntelligenceMay-9-2024

Longstanding data labeling practices in machine learning involve collecting and aggregating labels from multiple annotators. But what should we do when annotators disagree? Though annotator disagreement has long been seen as a problem to minimize, new perspectivist approaches challenge this assumption by treating disagreement as a valuable source of information. In this position paper, we examine practices and assumptions surrounding the causes of disagreement--some challenged by perspectivist approaches, and some that remain to be addressed--as well as practical and normative challenges for work operating under these assumptions. We conclude with recommendations for the data labeling pipeline and avenues for future research engaging with subjectivity and disagreement.

artificial intelligence, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2405.0586

Country:

Asia (0.93)
Europe > United Kingdom > England (0.28)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)

Genre: Research Report (0.82)

Industry: Education (0.68)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Classist Tools: Social Class Correlates with Performance in NLP

Curry, Amanda Cercas, Attanasio, Giuseppe, Talat, Zeerak, Hovy, Dirk

arXiv.org Artificial IntelligenceMar-7-2024

Since the foundational work of William Labov on the social stratification of language (Labov, 1964), linguistics has made concentrated efforts to explore the links between sociodemographic characteristics and language production and perception. But while there is strong evidence for socio-demographic characteristics in language, they are infrequently used in Natural Language Processing (NLP). Age and gender are somewhat well represented, but Labov's original target, socioeconomic status, is noticeably absent. And yet it matters. We show empirically that NLP disadvantages less-privileged socioeconomic groups. We annotate a corpus of 95K utterances from movies with social class, ethnicity and geographical language variety and measure the performance of NLP systems on three tasks: language modelling, automatic speech recognition, and grammar error correction. We find significant performance disparities that can be attributed to socioeconomic status as well as ethnicity and geographical differences. With NLP technologies becoming ever more ubiquitous and quotidian, they must accommodate all language varieties to avoid disadvantaging already marginalised groups. We argue for the inclusion of socioeconomic class in future language technologies.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2403.04445

Country:

North America > United States (0.48)
Europe > United Kingdom (0.46)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (0.69)

Industry:

Media (0.70)
Leisure & Entertainment (0.70)
Health & Medicine (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.68)
Information Technology > Artificial Intelligence > Speech > Speech Recognition (0.54)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

Impoverished Language Technology: The Lack of (Social) Class in NLP

Curry, Amanda Cercas, Talat, Zeerak, Hovy, Dirk

arXiv.org Artificial IntelligenceMar-6-2024

Since Labov's (1964) foundational work on the social stratification of language, linguistics has dedicated concerted efforts towards understanding the relationships between socio-demographic factors and language production and perception. Despite the large body of evidence identifying significant relationships between socio-demographic factors and language production, relatively few of these factors have been investigated in the context of NLP technology. While age and gender are well covered, Labov's initial target, socio-economic class, is largely absent. We survey the existing Natural Language Processing (NLP) literature and find that only 20 papers even mention socio-economic status. However, the majority of those papers do not engage with class beyond collecting information of annotator-demographics. Given this research lacuna, we provide a definition of class that can be operationalised by NLP researchers, and argue for including socio-economic class in future language technologies.

artificial intelligence, association, natural language, (17 more...)

arXiv.org Artificial Intelligence

2403.03874

Country:

North America > United States (0.68)
Europe > United Kingdom > England (0.14)

Genre: Research Report (0.90)

Industry: Health & Medicine (0.46)

Technology: Information Technology > Artificial Intelligence > Natural Language (1.00)

Add feedback

Subjective $\textit{Isms}$? On the Danger of Conflating Hate and Offence in Abusive Language Detection

Curry, Amanda Cercas, Abercrombie, Gavin, Talat, Zeerak

arXiv.org Artificial IntelligenceMar-4-2024

Natural language processing research has begun to embrace the notion of annotator subjectivity, motivated by variations in labelling. This approach understands each annotator's view as valid, which can be highly suitable for tasks that embed subjectivity, e.g., sentiment analysis. However, this construction may be inappropriate for tasks such as hate speech detection, as it affords equal validity to all positions on e.g., sexism or racism. We argue that the conflation of hate and offence can invalidate findings on hate speech, and call for future work to be situated in theory, disentangling hate from its orthogonal concept, offence.

artificial intelligence, association, natural language, (17 more...)

arXiv.org Artificial Intelligence

2403.02268

Country:

North America > United States > Texas (0.14)
North America > United States > Minnesota (0.14)
North America > United States > California (0.14)

Genre: Research Report (0.40)

Industry:

Law > Civil Rights & Constitutional Law (1.00)
Government (0.93)

Technology: Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.34)

Add feedback

Zero-shot Sentiment Analysis in Low-Resource Languages Using a Multilingual Sentiment Lexicon

Koto, Fajri, Beck, Tilman, Talat, Zeerak, Gurevych, Iryna, Baldwin, Timothy

arXiv.org Artificial IntelligenceFeb-3-2024

Improving multilingual language models capabilities in low-resource languages is generally difficult due to the scarcity of large-scale data in those languages. In this paper, we relax the reliance on texts in low-resource languages by using multilingual lexicons in pretraining to enhance multilingual capabilities. Specifically, we focus on zero-shot sentiment analysis tasks across 34 languages, including 6 high/medium-resource languages, 25 low-resource languages, and 3 code-switching datasets. We demonstrate that pretraining using multilingual lexicons, without using any sentence-level sentiment data, achieves superior zero-shot performance compared to models fine-tuned on English sentiment datasets, and large language models like GPT--3.5, BLOOMZ, and XGLM. These findings are observable for unseen low-resource languages to code-mixed scenarios involving high-resource languages.

large language model, machine learning, ml lex, (17 more...)

arXiv.org Artificial Intelligence

2402.02113

Country:

Europe (1.00)
Asia (1.00)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (1.00)
(2 more...)

Add feedback