AITopics | Sheng, Emily

Collaborating Authors

Sheng, Emily

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

A Framework for Automated Measurement of Responsible AI Harms in Generative AI Applications

Magooda, Ahmed, Helyar, Alec, Jackson, Kyle, Sullivan, David, Atalla, Chad, Sheng, Emily, Vann, Dan, Edgar, Richard, Palangi, Hamid, Lutz, Roman, Kong, Hongliang, Yun, Vincent, Kamal, Eslam, Zarfati, Federico, Wallach, Hanna, Bird, Sarah, Chen, Mei

arXiv.org Artificial IntelligenceOct-26-2023

We present a framework for the automated measurement of responsible AI (RAI) metrics for large language models (LLMs) and associated products and services. Our framework for automatically measuring harms from LLMs builds on existing technical and sociotechnical expertise and leverages the capabilities of state-of-the-art LLMs, such as GPT-4. We use this framework to run through several case studies investigating how different LLMs may violate a range of RAI-related principles. The framework may be employed alongside domain-specific sociotechnical expertise to create measurements for new harm areas in the future. By implementing this framework, we aim to enable more advanced harm measurement efforts and further the responsible use of LLMs.

large language model, machine learning, natural language, (6 more...)

arXiv.org Artificial Intelligence

2310.1775

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.40)

Add feedback

The Woman Worked as a Babysitter: On Biases in Language Generation

Sheng, Emily, Chang, Kai-Wei, Natarajan, Premkumar, Peng, Nanyun

arXiv.org Artificial IntelligenceSep-3-2019

W e present a systematic study of biases in natural language generation (NLG) by analyzing text generated from prompts that contain mentions of different demographic groups. In this work, we introduce the notion of the regard towards a demographic, use the varying levels of regard towards different demographics as a defining metric for bias in NLG, and analyze the extent to which sentiment scores are a relevant proxy metric for regard. To this end, we collect strategically-generated text from language models and manually annotate the text with both sentiment and regard scores. Additionally, we build an automatic regard classifier through transfer learning, so that we can analyze biases in unseen text. Together, these methods reveal the extent of the biased nature of language model generations. Our analysis provides a study of biases in NLG, bias metrics and correlated human judgments, and empirical evidence on the usefulness of our annotated dataset.

deep learning, neural network, xyz, (19 more...)

arXiv.org Artificial Intelligence

1909.01326

Country:

North America > United States (0.46)
Europe (0.29)

Genre: Research Report (0.50)

Industry: Government > Regional Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Generation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.73)

Add feedback