A Hierarchy of Limitations in Machine Learning
There is little argument about whether or not machine learning models are useful for applying to social systems. But if we take seriously George Box's dictum, or indeed the even older one that "the map is not the territory' (Korzybski, 1933), then there has been comparatively less systematic attention paid within the field to how machine learning models are wrong (Selbst et al., 2019) and seeing possible harms in that light. By "wrong" I do not mean in terms of making misclassifications, or even fitting over the'wrong' class of functions, but more fundamental mathematical/statistical assumptions, philosophical (in the sense used by Abbott, 1988) commitments about how we represent the world, and sociological processes of how models interact with target phenomena. This paper takes a particular model of machine learning research or application: one that its creators and deployers think provides a reliable way of interacting with the social world (whether that is through understanding, or in making predictions) without any intent to cause harm (McQuillan, 2018) and, in fact, a desire to not cause harm and instead improve the world, 1 for example as most explicitly in the various "{Data [Science], Machine Learning, Artificial Intelligence} for [Social] Good" initiatives, and more widely in framings around "fairness" or "ethics." I focus on the almost entirely statistical modern version of machine learning, rather than eclipsed older visions (see section 3). While many of the limitations I discuss apply to the use of machine learning in any domain, I focus on applications to the social world in order to explore the domain where limitations are strongest and stickiest.
Feb-29-2020
- Country:
- Asia
- Kazakhstan > West Kazakhstan Region (0.04)
- Middle East > Republic of Türkiye
- Karaman Province > Karaman (0.04)
- Vietnam (0.04)
- Europe
- Germany (0.04)
- Spain (0.04)
- United Kingdom
- England
- Cambridgeshire > Cambridge (0.04)
- Oxfordshire > Oxford (0.04)
- Scotland > Lanarkshire (0.04)
- England
- North America
- Canada > Ontario (0.04)
- United States
- Pennsylvania > Allegheny County
- Pittsburgh (0.04)
- Colorado > Boulder County
- Boulder (0.04)
- California (0.14)
- Massachusetts (0.04)
- Virginia (0.04)
- Wisconsin > Dane County
- Madison (0.04)
- Michigan (0.04)
- Illinois > Cook County
- Chicago (0.04)
- Hawaii (0.04)
- Oregon (0.04)
- New York (0.04)
- Minnesota (0.04)
- South Carolina (0.04)
- Pennsylvania > Allegheny County
- Oceania > Australia (0.14)
- Asia
- Genre:
- Overview (1.00)
- Research Report
- Experimental Study (1.00)
- New Finding (0.92)
- Industry:
- Transportation
- Education (1.00)
- Government
- Health & Medicine
- Epidemiology (0.67)
- Pharmaceuticals & Biotechnology (0.67)
- Therapeutic Area
- Immunology (0.67)
- Infections and Infectious Diseases (0.93)
- Oncology (1.00)
- Psychiatry/Psychology (0.92)
- Law
- Civil Rights & Constitutional Law (1.00)
- Criminal Law (0.67)
- Banking & Finance > Insurance (1.00)
- Information Technology (1.00)
- Consumer Products & Services (0.92)
- Social Sector (0.67)
- Technology:
- Information Technology > Artificial Intelligence > Machine Learning
- Decision Tree Learning (0.67)
- Neural Networks (0.67)
- Performance Analysis > Accuracy (0.92)
- Statistical Learning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning