Machine Unlearning Doesn't Do What You Think: Lessons for Generative AI Policy, Research, and Practice

Cooper, A. Feder, Choquette-Choo, Christopher A., Bogen, Miranda, Jagielski, Matthew, Filippova, Katja, Liu, Ken Ziyu, Chouldechova, Alexandra, Hayes, Jamie, Huang, Yangsibo, Mireshghallah, Niloofar, Shumailov, Ilia, Triantafillou, Eleni, Kairouz, Peter, Mitchell, Nicole, Liang, Percy, Ho, Daniel E., Choi, Yejin, Koyejo, Sanmi, Delgado, Fernando, Grimmelmann, James, Shmatikov, Vitaly, De Sa, Christopher, Barocas, Solon, Cyphert, Amy, Lemley, Mark, boyd, danah, Vaughan, Jennifer Wortman, Brundage, Miles, Bau, David, Neel, Seth, Jacobs, Abigail Z., Terzis, Andreas, Wallach, Hanna, Papernot, Nicolas, Lee, Katherine

Dec-9-2024–arXiv.org Artificial Intelligence

We articulate fundamental mismatches between technical methods for machine unlearning in Generative AI, and documented aspirations for broader impact that these methods could have for law and policy. These aspirations are both numerous and varied, motivated by issues that pertain to privacy, copyright, safety, and more. For example, unlearning is often invoked as a solution for removing the effects of targeted information from a generative-AI model's parameters, e.g., a particular individual's personal data or in-copyright expression of Spiderman that was included in the model's training data. Unlearning is also proposed as a way to prevent a model from generating targeted types of information in its outputs, e.g., generations that closely resemble a particular individual's data or reflect the concept of "Spiderman." Both of these goals--the targeted removal of information from a model and the targeted suppression of information from a model's outputs--present various technical and substantive challenges. We provide a framework for thinking rigorously about these challenges, which enables us to be clear about why unlearning is not a general-purpose solution for circumscribing generative-AI model behavior in service of broader positive impact. We aim for conceptual clarity and to encourage more thoughtful communication among machine learning (ML), law, and policy experts who seek to develop and apply technical methods for compliance with policy objectives.

information, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

Dec-9-2024

arXiv.org PDF

Add feedback

Country:
- South America > Colombia
  - Meta Department > Villavicencio (0.04)
- North America
  - Dominican Republic (0.04)
  - United States
    - Virginia (0.04)
    - Florida > Orange County (0.04)
    - West Virginia (0.04)
    - Texas (0.04)
    - North Carolina (0.04)
    - Michigan (0.04)
    - Iowa (0.04)
    - Colorado (0.04)
    - Maryland > Montgomery County
      - Gaithersburg (0.04)
    - California > Santa Clara County
      - Palo Alto (0.04)
    - New York > New York County
      - New York City (0.04)
- Europe
  - United Kingdom > England
    - Buckinghamshire > Milton Keynes (0.04)
  - Ireland > Leinster
    - County Dublin > Dublin (0.04)
- Asia
  - Indonesia > Bali (0.04)
  - South Korea > Seoul
    - Seoul (0.04)
- Africa > Eswatini
  - Manzini > Manzini (0.04)

Genre:
- Research Report > Promising Solution (0.48)

Industry:
- Information Technology > Security & Privacy (1.00)
- Health & Medicine > Therapeutic Area (0.93)
- Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.92)
- Law
  - Statutes (1.00)
  - Intellectual Property & Technology Law (1.00)
  - Civil Rights & Constitutional Law (0.93)
- Government > Regional Government
  - North America Government > United States Government (1.00)
  - Europe Government (0.67)
- Education > Curriculum
  - Subject-Specific Education (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning > Generative AI (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found