PRISM: Perspective Reasoning for Integrated Synthesis and Mediation as a Multi-Perspective Framework for AI Alignment

Feb-4-2025–arXiv.org Artificial Intelligence

In this work, we propose Perspective Reasoning for Integrated Synthesis and Mediation (PRISM), a multiple-perspective framework for addressing persistent challenges in AI alignment such as conflicting human values and specification gaming. Grounded in cognitive science and moral psychology, PRISM organizes moral concerns into seven "basis worldviews", each hypothesized to capture a distinct dimension of human moral cognition, ranging from survival-focused reflexes through higher-order integrative perspectives. It then applies a Pareto-inspired optimization scheme to reconcile competing priorities without reducing them to a single metric. Under the assumption of reliable context validation for robust use, the framework follows a structured workflow that elicits viewpoint-specific responses, synthesizes them into a balanced outcome, and mediates remaining conflicts in a transparent and iterative manner. By referencing layered approaches to moral cognition from cognitive science, moral psychology, and neuroscience, PRISM clarifies how different moral drives interact and systematically documents and mediates ethical tradeoffs. We illustrate its efficacy through real outputs produced by a working prototype, applying PRISM to classic alignment problems in domains such as public health policy, workplace automation, and education. By anchoring AI deliberation in these human vantage points, PRISM aims to bound interpretive leaps that might otherwise drift into non-human or machine-centric territory. We briefly outline future directions, including real-world deployments and formal verifications, while maintaining the core focus on multi-perspective synthesis and conflict mediation.

collaboration, interpretability, reflex override, (17 more...)

arXiv.org Artificial Intelligence

Feb-4-2025

arXiv.org PDF

Add feedback

Country:
- North America
  - United States
    - New York (0.04)
    - District of Columbia > Washington (0.04)
    - New Jersey
      - Mercer County > Princeton (0.04)
      - Hudson County > Hoboken (0.04)
    - Massachusetts
      - Middlesex County > Cambridge (0.04)
      - Essex County > Newburyport (0.04)
    - Indiana > St. Joseph County
      - Notre Dame (0.04)
    - Connecticut > New Haven County
      - New Haven (0.04)
    - California
      - San Francisco County > San Francisco (0.14)
      - Santa Clara County > Stanford (0.04)
      - San Diego County > San Diego (0.04)
      - Monterey County > Monterey (0.04)
  - Canada > Quebec
    - Montreal (0.04)
- Europe
  - United Kingdom > England
    - Oxfordshire > Oxford (0.14)
    - Cambridgeshire > Cambridge (0.14)
    - Greater London > London (0.04)
  - Ireland > Leinster
    - County Dublin > Dublin (0.04)

Genre:
- Workflow (1.00)
- Research Report (1.00)

Industry:
- Law (1.00)
- Information Technology > Security & Privacy (1.00)
- Government (1.00)
- Health & Medicine
  - Public Health (1.00)
  - Consumer Health (1.00)
  - Therapeutic Area
    - Psychiatry/Psychology > Mental Health (1.00)
    - Neurology (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language (1.00)
  - Machine Learning (1.00)
  - Issues > Social & Ethical Issues (1.00)
  - Representation & Reasoning > Agents (0.93)
  - Cognitive Science > Cognitive Architectures (0.68)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found