AI Research Considerations for Human Existential Safety (ARCHES)
Critch, Andrew, Krueger, David
–arXiv.org Artificial Intelligence
Framed in positive terms, this report examines how technical AI research might be steered in a manner that is more attentive to humanity's long-term prospects for survival as a species. In negative terms, we ask what existential risks humanity might face from AI development in the next century, and by what principles contemporary technical research might be directed to address those risks. A key property of hypothetical AI technologies is introduced, called \emph{prepotence}, which is useful for delineating a variety of potential existential risks from artificial intelligence, even as AI paradigms might shift. A set of \auxref{dirtot} contemporary research \directions are then examined for their potential benefit to existential safety. Each research direction is explained with a scenario-driven motivation, and examples of existing work from which to build. The research directions present their own risks and benefits to society that could occur at various scales of impact, and in particular are not guaranteed to benefit existential safety if major developments in them are deployed without adequate forethought and oversight. As such, each direction is accompanied by a consideration of potentially negative side effects.
arXiv.org Artificial Intelligence
May-29-2020
- Country:
- North America
- United States
- Pennsylvania > Allegheny County
- Pittsburgh (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Illinois > Cook County
- Chicago (0.04)
- California
- Los Angeles County > Los Angeles (0.13)
- Ventura County > Thousand Oaks (0.04)
- Santa Clara County > Palo Alto (0.04)
- Alameda County > Berkeley (0.04)
- Pennsylvania > Allegheny County
- Canada
- United States
- Europe
- Russia (0.04)
- United Kingdom > England
- Oxfordshire > Oxford (0.14)
- Cambridgeshire > Cambridge (0.04)
- Ukraine > Kyiv Oblast
- Chernobyl (0.04)
- Estonia > Harju County
- Tallinn (0.04)
- Asia
- Russia (0.04)
- Middle East > Jordan (0.04)
- India > Madhya Pradesh
- Bhopal (0.04)
- North America
- Genre:
- Overview (1.00)
- Instructional Material (1.00)
- Research Report
- Experimental Study (0.92)
- New Finding (0.92)
- Industry:
- Law Enforcement & Public Safety (1.00)
- Information Technology > Security & Privacy (1.00)
- Transportation > Air (1.00)
- Automobiles & Trucks (0.92)
- Leisure & Entertainment > Games (0.67)
- Banking & Finance > Trading (0.67)
- Energy > Power Industry
- Education > Educational Setting
- Higher Education (0.67)
- Law
- Statutes (0.92)
- Business Law (0.67)
- Health & Medicine
- Therapeutic Area > Psychiatry/Psychology (1.00)
- Consumer Health (1.00)
- Government
- Technology:
- Information Technology
- Communications > Networks (1.00)
- Data Science > Data Mining (0.67)
- Artificial Intelligence
- Natural Language (1.00)
- Issues > Social & Ethical Issues (1.00)
- Applied AI (1.00)
- Robots > Autonomous Vehicles (0.93)
- Cognitive Science > Simulation of Human Behavior (0.92)
- Representation & Reasoning
- Agents (1.00)
- Uncertainty (0.67)
- Machine Learning
- Reinforcement Learning (1.00)
- Neural Networks > Deep Learning (0.92)
- Learning Graphical Models
- Undirected Networks > Markov Models (0.46)
- Directed Networks > Bayesian Learning (0.45)
- Information Technology