A Hierarchy of Limitations in Machine Learning
There is little argument about whether or not machine learning models are useful for applying to social systems. But if we take seriously George Box's dictum, or indeed the even older one that "the map is not the territory' (Korzybski, 1933), then there has been comparatively less systematic attention paid within the field to how machine learning models are wrong (Selbst et al., 2019) and seeing possible harms in that light. By "wrong" I do not mean in terms of making misclassifications, or even fitting over the'wrong' class of functions, but more fundamental mathematical/statistical assumptions, philosophical (in the sense used by Abbott, 1988) commitments about how we represent the world, and sociological processes of how models interact with target phenomena. This paper takes a particular model of machine learning research or application: one that its creators and deployers think provides a reliable way of interacting with the social world (whether that is through understanding, or in making predictions) without any intent to cause harm (McQuillan, 2018) and, in fact, a desire to not cause harm and instead improve the world, 1 for example as most explicitly in the various "{Data [Science], Machine Learning, Artificial Intelligence} for [Social] Good" initiatives, and more widely in framings around "fairness" or "ethics." I focus on the almost entirely statistical modern version of machine learning, rather than eclipsed older visions (see section 3). While many of the limitations I discuss apply to the use of machine learning in any domain, I focus on applications to the social world in order to explore the domain where limitations are strongest and stickiest.
Feb-29-2020
- Country:
- Oceania > Australia (0.14)
- North America
- Canada > Ontario (0.04)
- United States
- California (0.14)
- Virginia (0.04)
- Massachusetts (0.04)
- New York (0.04)
- Hawaii (0.04)
- South Carolina (0.04)
- Minnesota (0.04)
- Oregon (0.04)
- Michigan (0.04)
- Illinois > Cook County
- Chicago (0.04)
- Wisconsin > Dane County
- Madison (0.04)
- Colorado > Boulder County
- Boulder (0.04)
- Pennsylvania > Allegheny County
- Pittsburgh (0.04)
- Europe
- Spain (0.04)
- Germany (0.04)
- United Kingdom
- Scotland > Lanarkshire (0.04)
- England
- Cambridgeshire > Cambridge (0.04)
- Oxfordshire > Oxford (0.04)
- Asia
- Vietnam (0.04)
- Middle East > Republic of Türkiye
- Karaman Province > Karaman (0.04)
- Genre:
- Overview (1.00)
- Research Report
- Experimental Study (1.00)
- New Finding (0.92)
- Industry:
- Information Technology (1.00)
- Banking & Finance > Insurance (1.00)
- Education (1.00)
- Consumer Products & Services (0.92)
- Social Sector (0.67)
- Law
- Civil Rights & Constitutional Law (1.00)
- Criminal Law (0.67)
- Health & Medicine
- Epidemiology (0.67)
- Pharmaceuticals & Biotechnology (0.67)
- Therapeutic Area
- Oncology (1.00)
- Infections and Infectious Diseases (0.93)
- Psychiatry/Psychology (0.92)
- Immunology (0.67)
- Government
- Transportation
- Technology:
- Information Technology > Artificial Intelligence > Machine Learning
- Statistical Learning (1.00)
- Performance Analysis > Accuracy (0.92)
- Decision Tree Learning (0.67)
- Neural Networks (0.67)
- Information Technology > Artificial Intelligence > Machine Learning