A high-bias, low-variance introduction to Machine Learning for physicists
Mehta, Pankaj, Bukov, Marin, Wang, Ching-Hao, Day, Alexandre G. R., Richardson, Clint, Fisher, Charles K., Schwab, David J.
Machine Learning (ML) is one of the most exciting and dynamic areas of modern research and application. The purpose of this review is to provide an introduction to the core concepts and tools of machine learning in a manner easily understood and intuitive to physicists. The review begins by covering fundamental concepts in ML and modern statistics such as the bias-variance tradeoff, overfitting, regularization, and generalization before moving on to more advanced topics in both supervised and unsupervised learning. Topics covered in the review include ensemble models, deep learning and neural networks, clustering and data visualization, energy-based models (including MaxEnt models and Restricted Boltzmann Machines), and variational methods. Throughout, we emphasize the many natural connections between ML and statistical physics. A notable aspect of the review is the use of Python notebooks to introduce modern ML/statistical packages to readers using physics-inspired datasets (the Ising Model and Monte-Carlo simulations of supersymmetric decays of proton-proton collisions). We conclude with an extended outlook discussing possible uses of machine learning for furthering our understanding of the physical world as well as open problems in ML where physicists maybe able to contribute. (Notebooks are available at https://physics.bu.edu/~pankajm/MLnotebooks.html )
Mar-23-2018
- Country:
- North America > United States
- California
- Alameda County > Berkeley (0.13)
- San Francisco County > San Francisco (0.13)
- New York > New York County
- New York City (0.13)
- California
- North America > United States
- Genre:
- Instructional Material > Course Syllabus & Notes (1.00)
- Overview (1.00)
- Research Report (1.00)
- Summary/Review (1.00)
- Workflow (1.00)
- Industry:
- Education
- Curriculum (0.67)
- Educational Setting (0.92)
- Energy > Oil & Gas
- Upstream (0.45)
- Health & Medicine
- Pharmaceuticals & Biotechnology (0.67)
- Therapeutic Area > Neurology (0.45)
- Information Technology (0.67)
- Education
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning
- Ensemble Learning (1.00)
- Learning Graphical Models
- Directed Networks > Bayesian Learning (1.00)
- Undirected Networks > Markov Models (1.00)
- Neural Networks > Deep Learning (1.00)
- Performance Analysis > Accuracy (0.93)
- Statistical Learning > Clustering (1.00)
- Representation & Reasoning
- Optimization (1.00)
- Uncertainty > Bayesian Inference (1.00)
- Machine Learning
- Information Technology > Artificial Intelligence