NeuroAI for AI Safety

Mineault, Patrick, Zanichelli, Niccolò, Peng, Joanne Zichen, Arkhipov, Anton, Bingham, Eli, Jara-Ettinger, Julian, Mackevicius, Emily, Marblestone, Adam, Mattar, Marcelo, Payne, Andrew, Sanborn, Sophia, Schroeder, Karen, Tavares, Zenna, Tolias, Andreas

Nov-27-2024–arXiv.org Artificial Intelligence

As AI systems become increasingly powerful, the need for safe AI has become more pressing. Humans are an attractive model for AI safety: as the only known agents capable of general intelligence, they perform robustly even under conditions that deviate significantly from prior experiences, explore the world safely, understand pragmatics, and can cooperate to meet their intrinsic goals. Intelligence, when coupled with cooperation and safety mechanisms, can drive sustained progress and well-being. These properties are a function of the architecture of the brain and the learning algorithms it implements. Neuroscience may thus hold important keys to technical AI safety that are currently underexplored and underutilized. In this roadmap, we highlight and critically evaluate several paths toward AI safety inspired by neuroscience: emulating the brain's representations, information processing, and architecture; building robust sensory and motor systems from imitating brain data and bodies; fine-tuning AI systems on brain data; advancing interpretability using neuroscience methods; and scaling up cognitively-inspired architectures. We make several concrete recommendations for how neuroscience can positively impact AI safety.

large language model, pattern recognition, simulation of human behavior, (27 more...)

arXiv.org Artificial Intelligence

Nov-27-2024

arXiv.org PDF

Add feedback

Country:
- Africa > Senegal
  - Kolda Region > Kolda (0.04)
- Asia
  - China > Xinjiang Uygur Autonomous Region (0.04)
  - Middle East > Jordan (0.13)
- Europe
  - Ireland (0.04)
  - Spain > Aragón (0.04)
  - Switzerland > Basel-City
    - Basel (0.04)
  - United Kingdom > England
    - Cambridgeshire > Cambridge (0.14)
    - Greater London > London (0.04)
    - Oxfordshire > Oxford (0.14)
- North America > United States
  - California
    - San Francisco County > San Francisco (0.04)
    - Santa Clara County > Palo Alto (0.04)
  - Connecticut > New Haven County
    - New Haven (0.04)
  - District of Columbia > Washington (0.04)
  - Gulf of Mexico > Central GOM (0.04)
  - Iowa (0.04)
  - Massachusetts > Middlesex County
    - Cambridge (0.04)
  - New York > New York County
    - New York City (0.04)
  - Utah (0.04)
- Pacific Ocean > North Pacific Ocean
  - San Francisco Bay > Golden Gate (0.04)
- South America > Chile
  - Santiago Metropolitan Region > Santiago Province > Santiago (0.04)

Genre:
- Research Report
  - Experimental Study (1.00)
  - New Finding (0.92)
  - Promising Solution (0.67)

Industry:
- Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Cognitive Science
    - Neuroscience (1.00)
    - Problem Solving (1.00)
    - Simulation of Human Behavior (0.67)
  - Issues > Social & Ethical Issues (1.00)
  - Machine Learning
    - Learning Graphical Models > Directed Networks
      - Bayesian Learning (1.00)
    - Neural Networks > Deep Learning (1.00)
    - Pattern Recognition (0.67)
    - Reinforcement Learning (1.00)
    - Statistical Learning (1.00)
  - Natural Language
    - Chatbot (0.67)
    - Large Language Model (1.00)
  - Representation & Reasoning
    - Agents (1.00)
    - Uncertainty > Bayesian Inference (1.00)
  - Robots > Autonomous Vehicles (0.92)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found