Generalizing in the Real World with Representation Learning

Oct-18-2022–arXiv.org Artificial Intelligence

Machine learning (ML) formalizes the problem of getting computers to learn from experience as optimization of performance according to some metric(s) on a set of data examples. This is in contrast to requiring behaviour specified in advance (e.g. by hard-coded rules). Formalization of this problem has enabled great progress in many applications with large real-world impact, including translation, speech recognition, self-driving cars, and drug discovery. But practical instantiations of this formalism make many assumptions - for example, that data are i.i.d.: independent and identically distributed - whose soundness is seldom investigated. And in making great progress in such a short time, the field has developed many norms and ad-hoc standards, focused on a relatively small range of problem settings. As applications of ML, particularly in artificial intelligence (AI) systems, become more pervasive in the real world, we need to critically examine these assumptions, norms, and problem settings, as well as the methods that have become de-facto standards. There is much we still do not understand about how and why deep networks trained with stochastic gradient descent are able to generalize as well as they do, why they fail when they do, and how they will perform on out-of-distribution data. In this thesis I cover some of my work towards better understanding deep net generalization, identify several ways assumptions and problem settings fail to generalize to the real world, and propose ways to address those failures in practice.

artificial intelligence, bayesian inference, machine learning, (18 more...)

arXiv.org Artificial Intelligence

Oct-18-2022

arXiv.org PDF

Add feedback

Country:
- North America
  - United States
    - Washington (0.04)
    - New York > New York County
      - New York City (0.04)
    - New Jersey > Hudson County
      - Secaucus (0.04)
    - Massachusetts > Middlesex County
      - Cambridge (0.13)
    - California
      - San Francisco County > San Francisco (0.13)
      - Santa Clara County > Palo Alto (0.04)
  - Canada
    - Ontario > Toronto (0.13)
    - Quebec > Montreal (0.04)
- Europe
  - Italy > Sardinia (0.04)
  - United Kingdom > England
    - Cambridgeshire > Cambridge (0.04)
  - Spain > Catalonia
    - Barcelona Province > Barcelona (0.04)
  - Germany > North Rhine-Westphalia
    - Upper Bavaria > Munich (0.04)
- Asia > Middle East
  - Jordan (0.04)

Genre:
- Overview (1.00)
- Research Report
  - New Finding (1.00)
  - Experimental Study (1.00)

Industry:
- Leisure & Entertainment (1.00)
- Information Technology > Security & Privacy (1.00)
- Education (1.00)
- Media > Film (0.92)
- Law (0.92)
- Health & Medicine
  - Pharmaceuticals & Biotechnology (1.00)
  - Epidemiology (1.00)
  - Therapeutic Area
    - Infections and Infectious Diseases (1.00)
    - Immunology (1.00)
    - Pulmonary/Respiratory Diseases (0.92)
- Government > Regional Government
  - North America Government > United States Government (0.45)

Technology:
- Information Technology > Artificial Intelligence
  - Cognitive Science (1.00)
  - Representation & Reasoning
    - Agents (1.00)
    - Rule-Based Reasoning (0.92)
    - Uncertainty > Bayesian Inference (0.67)
  - Machine Learning
    - Statistical Learning (1.00)
    - Performance Analysis > Accuracy (1.00)
    - Neural Networks > Deep Learning (1.00)
    - Learning Graphical Models > Directed Networks
      - Bayesian Learning (0.67)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found