Trustworthy Machine Learning
Mucsányi, Bálint, Kirchhof, Michael, Nguyen, Elisa, Rubinstein, Alexander, Oh, Seong Joon
–arXiv.org Artificial Intelligence
As machine learning technology gets applied to actual products and solutions, new challenges have emerged. Models unexpectedly fail to generalize to small changes in the distribution, tend to be confident on novel data they have never seen, or cannot communicate the rationale behind their decisions effectively with the end users. Collectively, we face a trustworthiness issue with the current machine learning technology. This textbook on Trustworthy Machine Learning (TML) covers a theoretical and technical background of four key topics in TML: Out-of-Distribution Generalization, Explainability, Uncertainty Quantification, and Evaluation of Trustworthiness. We discuss important classical and contemporary research papers of the aforementioned fields and uncover and connect their underlying intuitions. The book evolved from the homonymous course at the University of T\"ubingen, first offered in the Winter Semester of 2022/23. It is meant to be a stand-alone product accompanied by code snippets and various pointers to further sources on topics of TML. The dedicated website of the book is https://trustworthyml.io/.
arXiv.org Artificial Intelligence
Oct-12-2023
- Country:
- Europe
- Germany > Baden-Württemberg
- Tübingen Region > Tübingen (0.13)
- Switzerland > Zürich
- Zürich (0.13)
- Germany > Baden-Württemberg
- North America > United States
- Louisiana (0.13)
- New Mexico > Lea County (0.13)
- Europe
- Genre:
- Instructional Material (1.00)
- Overview (1.00)
- Research Report
- Experimental Study (1.00)
- New Finding (1.00)
- Promising Solution (0.92)
- Summary/Review (1.00)
- Industry:
- Banking & Finance > Economy (0.67)
- Energy > Oil & Gas
- Upstream (0.45)
- Government (1.00)
- Health & Medicine > Therapeutic Area (1.00)
- Information Technology
- Security & Privacy (1.00)
- Services (0.92)
- Law (0.67)
- Leisure & Entertainment
- Transportation
- Technology:
- Information Technology > Artificial Intelligence
- Issues > Social & Ethical Issues (0.67)
- Machine Learning
- Learning Graphical Models > Directed Networks
- Bayesian Learning (1.00)
- Neural Networks > Deep Learning (1.00)
- Pattern Recognition (0.67)
- Performance Analysis > Accuracy (1.00)
- Statistical Learning (1.00)
- Learning Graphical Models > Directed Networks
- Natural Language
- Chatbot (0.67)
- Explanation & Argumentation (1.00)
- Large Language Model (1.00)
- Text Processing (1.00)
- Representation & Reasoning
- Mathematical & Statistical Methods (0.67)
- Optimization (1.00)
- Uncertainty > Bayesian Inference (1.00)
- Vision
- Face Recognition (0.92)
- Image Understanding (0.67)
- Information Technology > Artificial Intelligence