Rethinking Machine Ethics -- Can LLMs Perform Moral Reasoning through the Lens of Moral Theories?

Zhou, Jingyan, Hu, Minda, Li, Junan, Zhang, Xiaoying, Wu, Xixin, King, Irwin, Meng, Helen

Jul-1-2024–arXiv.org Artificial Intelligence

Making moral judgments is an essential step toward developing ethical AI systems. Prevalent approaches are mostly implemented in a bottom-up manner, which uses a large set of annotated data to train models based on crowd-sourced opinions about morality. These approaches have been criticized for overgeneralizing the moral stances of a limited group of annotators and lacking explainability. This work proposes a flexible top-down framework to steer (Large) Language Models (LMs) to perform moral reasoning with well-established moral theories from interdisciplinary research. The theory-guided top-down framework can incorporate various moral theories. Our experiments demonstrate the effectiveness of the proposed framework on datasets derived from moral theories. Furthermore, we show the alignment between different moral theories and existing morality datasets. Our analysis exhibits the potential and flaws in existing resources (models and datasets) in developing explainable moral judgment-making systems.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

Jul-1-2024

arXiv.org PDF

Add feedback

Country:
- Asia > Middle East
  - UAE (0.14)
- Europe > United Kingdom
  - England (0.14)
- North America > United States (0.46)

Genre:
- Research Report > New Finding (0.68)

Industry:
- Health & Medicine > Therapeutic Area (0.46)

Technology:
- Information Technology
  - Artificial Intelligence
    - Issues > Social & Ethical Issues (1.00)
    - Machine Learning > Neural Networks
      - Deep Learning (0.72)
    - Natural Language > Large Language Model (1.00)
  - Communications > Social Media (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found