A Hierarchy of Limitations in Machine Learning

Feb-29-2020–arXiv.org Machine Learning

There is little argument about whether or not machine learning models are useful for applying to social systems. But if we take seriously George Box's dictum, or indeed the even older one that "the map is not the territory' (Korzybski, 1933), then there has been comparatively less systematic attention paid within the field to how machine learning models are wrong (Selbst et al., 2019) and seeing possible harms in that light. By "wrong" I do not mean in terms of making misclassifications, or even fitting over the'wrong' class of functions, but more fundamental mathematical/statistical assumptions, philosophical (in the sense used by Abbott, 1988) commitments about how we represent the world, and sociological processes of how models interact with target phenomena. This paper takes a particular model of machine learning research or application: one that its creators and deployers think provides a reliable way of interacting with the social world (whether that is through understanding, or in making predictions) without any intent to cause harm (McQuillan, 2018) and, in fact, a desire to not cause harm and instead improve the world, 1 for example as most explicitly in the various "{Data [Science], Machine Learning, Artificial Intelligence} for [Social] Good" initiatives, and more widely in framings around "fairness" or "ethics." I focus on the almost entirely statistical modern version of machine learning, rather than eclipsed older visions (see section 3). While many of the limitations I discuss apply to the use of machine learning in any domain, I focus on applications to the social world in order to explore the domain where limitations are strongest and stickiest.

artificial intelligence, machine learning, prediction, (17 more...)

arXiv.org Machine Learning

Feb-29-2020

arXiv.org PDF

Add feedback

Country:
- Asia
  - Kazakhstan > West Kazakhstan Region (0.04)
  - Middle East > Republic of Türkiye
    - Karaman Province > Karaman (0.04)
  - Vietnam (0.04)
- Europe
  - Germany (0.04)
  - Spain (0.04)
  - United Kingdom
    - England
      - Cambridgeshire > Cambridge (0.04)
      - Oxfordshire > Oxford (0.04)
    - Scotland > Lanarkshire (0.04)
- North America
  - Canada > Ontario (0.04)
  - United States
    - Pennsylvania > Allegheny County
      - Pittsburgh (0.04)
    - Colorado > Boulder County
      - Boulder (0.04)
    - California (0.14)
    - Massachusetts (0.04)
    - Virginia (0.04)
    - Wisconsin > Dane County
      - Madison (0.04)
    - Michigan (0.04)
    - Illinois > Cook County
      - Chicago (0.04)
    - Hawaii (0.04)
    - Oregon (0.04)
    - New York (0.04)
    - Minnesota (0.04)
    - South Carolina (0.04)
- Oceania > Australia (0.14)

Genre:
- Overview (1.00)
- Research Report
  - Experimental Study (1.00)
  - New Finding (0.92)

Industry:
- Transportation
  - Ground > Road (0.67)
  - Passenger (0.67)
- Education (1.00)
- Government
  - Military (0.92)
  - Regional Government > North America Government
    - United States Government (1.00)
- Health & Medicine
  - Epidemiology (0.67)
  - Pharmaceuticals & Biotechnology (0.67)
  - Therapeutic Area
    - Immunology (0.67)
    - Infections and Infectious Diseases (0.93)
    - Oncology (1.00)
    - Psychiatry/Psychology (0.92)
- Law
  - Civil Rights & Constitutional Law (1.00)
  - Criminal Law (0.67)
- Banking & Finance > Insurance (1.00)
- Information Technology (1.00)
- Consumer Products & Services (0.92)
- Social Sector (0.67)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Decision Tree Learning (0.67)
  - Neural Networks (0.67)
  - Performance Analysis > Accuracy (0.92)
  - Statistical Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found