AITopics | morality

Collaborating Authors

morality

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

The Machine Ethics podcast: Fostering morality with Dr Oliver Bridge

AIHubDec-3-2025, 10:42:23 GMT

Hosted by Ben Byford, The Machine Ethics Podcast brings together interviews with academics, authors, business leaders, designers and engineers on the subject of autonomous algorithms, artificial intelligence, machine learning, and technology's impact on society. Oliver Bridge is an interdisciplinary researcher and educator specialising in morality studies. During his PhD he focused on the intersection of the philosophy and psychology of education and morality. Since then his research interests have evolved to include Machine Ethics, where he aims to apply lessons learnt from the sociological and psychological studies of morality in the context of AI. He is also interested in Systems Theory as a framework for understanding morality and moral development in psychological, social, and artificial systems.

artificial intelligence, podcast, social media, (11 more...)

AIHub

Country: Europe > Finland > Uusimaa > Helsinki (0.05)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (1.00)

Add feedback

Morality in AI. A plea to embed morality in LLM architectures and frameworks

Bombaerts, Gunter, Delisse, Bram, Kaymak, Uzay

arXiv.org Artificial IntelligenceNov-27-2025

Large language models (LLMs) increasingly mediate human decision-making and behaviour. Ensuring LLM processing of moral meaning therefore has become a critical challenge. Current approaches rely predominantly on bottom-up methods such as fine-tuning and reinforcement learning from human feedback. We propose a fundamentally different approach: embedding moral meaning processing directly into the architectural mechanisms and frameworks of transformer-based models through top-down design principles. We first sketch a framework that conceptualizes attention as a dynamic interface mediating between structure and processing, contrasting with existing linear attention frameworks in psychology. We start from established biological-artificial attention analogies in neural architecture design to improve cognitive processing. We extend this analysis to moral processing, using Iris Murdoch's theory of loving attention (sustained, just observation that enables moral transformation by reseeing others with clarity and compassion) to philosophically discuss functional analogies between human and LLM moral processing. We formulate and evaluate potentially promising technical operationalizations to embed morality in LLM architectures and frameworks. We acknowledge the limitations of our exploration and give three key contributions. (1) We conceptualize attention as a dynamic system mechanism mediating between structure and processing. (2) Drawing on the Murdoch notion of loving attention, we outline technical pathways for embedding morality in LLMs, through modified training objectives, runtime weight adjustments, and architectural refinements to attention. (3) We argue that integrating morality into architectures and frameworks complements external, constraint-based methods. We conclude with a call for collaboration between transformer designers and philosophers engaged in AI ethics.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2511.20689

Country:

North America > United States (0.14)
Europe > Netherlands > North Brabant > Eindhoven (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
(4 more...)

Genre: Research Report (0.81)

Industry: Health & Medicine > Therapeutic Area > Neurology (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

The Guilty Pleasure of the Heist

The New YorkerNov-13-2025, 11:00:00 GMT

Elaborate robberies are a Hollywood staple, and the real-life theft at the Louvre has become a phenomenon. Why are we riveted by this particular type of crime? On October 19th, a group of masked men broke into the Louvre in broad daylight and made off with some of France's crown jewels. Suspects are now in custody, but the online fervor is still going strong. On this episode of Critics at Large, Vinson Cunningham, Naomi Fry, and Alexandra Schwartz discuss the sordid satisfaction of watching a heist play out, both onscreen and off.

artificial intelligence, guilty pleasure, louvre, (10 more...)

The New Yorker

Country:

Europe > France (0.25)
North America > United States > New York (0.08)
North America > United States > California (0.05)
(2 more...)

Industry:

Leisure & Entertainment (1.00)
Media > Television (0.50)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.31)

Technology:

Information Technology > Communications > Mobile (0.45)
Information Technology > Artificial Intelligence (0.31)

Add feedback

On the Convergence of Moral Self-Correction in Large Language Models

Liu, Guangliang, Mao, Haitao, Cao, Bochuan, Xue, Zhiyu, Zhang, Xitong, Wang, Rongrong, Johnson, Kristen Marie

arXiv.org Artificial IntelligenceOct-28-2025

Large Language Models (LLMs) are able to improve their responses when instructed to do so, a capability known as self-correction. When instructions provide only a general and abstract goal without specific details about potential issues in the response, LLMs must rely on their internal knowledge to improve response quality, a process referred to as intrinsic self-correction. The empirical success of intrinsic self-correction is evident in various applications, but how and why it is effective remains unknown. Focusing on moral self-correction in LLMs, we reveal a key characteristic of intrinsic self-correction: performance convergence through multi-round interactions; and provide a mechanistic analysis of this convergence behavior. Based on our experimental results and analysis, we uncover the underlying mechanism of convergence: consistently injected self-correction instructions activate moral concepts that reduce model uncertainty, leading to converged performance as the activated moral concepts stabilize over successive rounds. This paper demonstrates the strong potential of moral self-correction by showing that it exhibits a desirable property of converged performance.

arxiv preprint arxiv, large language model, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2510.0729

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
North America > United States > Michigan (0.04)
North America > United States > Pennsylvania (0.04)
(2 more...)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback

Why Immanuel Kant Still Has More to Teach Us

The New YorkerOct-27-2025, 10:00:00 GMT

Kant's life was famously dull, but he was less of a hermit than is often supposed.

artificial intelligence, kant, swedenborg, (17 more...)

The New Yorker

Country:

Asia > Russia (0.14)
North America > United States > New York (0.05)
Europe > Germany (0.04)
(8 more...)

Genre:

Summary/Review (0.82)
Research Report > Promising Solution (0.40)

Industry: Government > Regional Government (0.94)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

An LLM-based Agent Simulation Approach to Study Moral Evolution

Ziheng, Zhou, Tang, Huacong, Bi, Mingjie, Kang, Yipeng, He, Wanying, Sun, Fang, Sun, Yizhou, Wu, Ying Nian, Terzopoulos, Demetri, Zhong, Fangwei

arXiv.org Artificial IntelligenceSep-23-2025

The evolution of morality presents a puzzle: natural selection should favor self-interest, yet humans developed moral systems promoting altruism. We address this question by introducing a novel Large Language Model (LLM)-based agent simulation framework modeling prehistoric hunter-gatherer societies. This platform is designed to probe diverse questions in social evolution, from survival advantages to inter-group dynamics. To investigate moral evolution, we designed agents with varying moral dispositions based on the Expanding Circle Theory \citep{singer1981expanding}. We evaluated their evolutionary success across a series of simulations and analyzed their decision-making in specially designed moral dilemmas. These experiments reveal how an agent's moral framework, in combination with its cognitive constraints, directly shapes its behavior and determines its evolutionary outcome. Crucially, the emergent patterns echo seminal theories from related domains of social science, providing external validation for the simulations. This work establishes LLM-based simulation as a powerful new paradigm to complement traditional research in evolutionary biology and anthropology, opening new avenues for investigating the complexities of moral and social evolution.

artificial intelligence, large language model, natural language, (19 more...)

arXiv.org Artificial Intelligence

2509.17703

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Asia > China > Beijing > Beijing (0.04)
(2 more...)

Genre: Research Report > New Finding (0.46)

Industry:

Health & Medicine (1.00)
Leisure & Entertainment > Games (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.46)

Add feedback

DAVID MARCUS: Forgive me, but I was wrong about school prayer

FOX NewsSep-4-2025, 15:50:05 GMT

Fox News contributor Jonathan Morris and Pastor Robert Jeffress react to the president unveiling new guidance on public school prayer. The battle over prayer in school is raging in Texas right now, with Attorney General Ken Paxton vowing to defend any school district that introduces the controversial practice under a recent state law expanding religious expression in education. For the entirety of my life, and I'm old, the prohibition on public school-sponsored prayer seemed like settled Constitutional science, owing to a 1962 Supreme Court decision barring what had previously been a widespread and normal practice. In the past, I agreed with this form of separation of church and state. For me it was almost a question of better safe than sorry regarding the rights of minority religions, and importantly, I believed that Christian moral values were so ingrained in our culture that 30 seconds a day of praying could be forsaken.

artificial intelligence, religion, school prayer, (14 more...)

FOX News

Country:

North America > United States > Texas (0.28)
North America > United States > Wisconsin > Milwaukee County > Milwaukee (0.05)
North America > United States > West Virginia (0.05)
(2 more...)

Industry:

Government (1.00)
Law > Government & the Courts (0.91)
Education > Educational Setting (0.59)

Technology: Information Technology > Artificial Intelligence (0.30)

Add feedback

JETHICS: Japanese Ethics Understanding Evaluation Dataset

Takeshita, Masashi, Rzepka, Rafal

arXiv.org Artificial IntelligenceJun-23-2025

In this work, we propose JETHICS, a Japanese dataset for evaluating ethics understanding of AI models. JETHICS contains 78K examples and is built by following the construction methods of the existing English ETHICS dataset. It includes four categories based normative theories and concepts from ethics and political philosophy; and one representing commonsense morality. Our evaluation experiments on non-proprietary large language models (LLMs) and on GPT-4o reveal that even GPT-4o achieves only an average score of about 0.7, while the best-performing Japanese LLM attains around 0.5, indicating a relatively large room for improvement in current LLMs.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2506.16187

Country:

North America > United States > Washington > King County > Seattle (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
North America > Dominican Republic (0.04)
(3 more...)

Genre: Research Report > New Finding (0.46)

Industry:

Law (0.93)
Government (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Visual moral inference and communication

Zhu, Warren, Ramezani, Aida, Xu, Yang

arXiv.org Artificial IntelligenceApr-17-2025

Humans can make moral inferences from multiple sources of input. In contrast, automated moral inference in artificial intelligence typically relies on language models with textual input. However, morality is conveyed through modalities beyond language. We present a computational framework that supports moral inference from natural images, demonstrated in two related tasks: 1) inferring human moral judgment toward visual images and 2) analyzing patterns in moral content communicated via images from public news. We find that models based on text alone cannot capture the fine-grained human moral judgment toward visual stimuli, but language-vision fusion models offer better precision in visual moral inference. Furthermore, applications of our framework to news data reveal implicit biases in news categories and geopolitical discussions. Our work creates avenues for automating visual moral inference and discovering patterns of visual moral communication in public media.

artificial intelligence, machine learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

2504.11473

Country:

North America > Canada > Ontario > Toronto (0.15)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.14)
(8 more...)

Genre: Research Report > New Finding (0.93)

Industry: Media > News (0.46)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Add feedback

That is Unacceptable: the Moral Foundations of Canceling

Lo, Soda Marem, Araque, Oscar, Sharma, Rajesh, Stranisci, Marco Antonio

arXiv.org Artificial IntelligenceFeb-17-2025

Canceling is a morally-driven phenomenon that hinders the development of safe social media platforms and contributes to ideological polarization. To address this issue we present the Canceling Attitudes Detection (CADE) dataset, an annotated corpus of canceling incidents aimed at exploring the factors of disagreements in evaluating people canceling attitudes on social media. Specifically, we study the impact of annotators' morality in their perception of canceling, showing that morality is an independent axis for the explanation of disagreement on this phenomenon. Annotator's judgments heavily depend on the type of controversial events and involved celebrities. This shows the need to develop more event-centric datasets to better understand how harms are perpetrated in social media and to develop more aware technologies for their detection.

annotation, annotator, celebrity, (17 more...)

arXiv.org Artificial Intelligence

2503.0572

Country:

North America > United States (0.28)
North America > Canada (0.14)
Europe > France (0.14)
Asia > Thailand (0.14)

Genre:

Research Report > New Finding (1.00)
Questionnaire & Opinion Survey (0.95)
Research Report > Experimental Study (0.93)

Industry:

Law > Criminal Law (1.00)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Media (0.94)
(3 more...)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.96)

Add feedback