Ensuring the Robustness and Reliability of Data-Driven Knowledge Discovery Models in Production and Manufacturing
Tripathi, Shailesh, Muhr, David, Manuel, Brunner, Emmert-Streib, Frank, Jodlbauer, Herbert, Dehmer, Matthias
–arXiv.org Artificial Intelligence
The implementation of robust, stable, and user-centered data analytics and machine learning models is confronted by numerous challenges in production and manufacturing. Therefore, a systematic approach is required to develop, evaluate, and deploy such models. The data-driven knowledge discovery framework provides an orderly partition of the data-mining processes to ensure the practical implementation of data analytics and machine learning models. However, the practical application of robust industry-specific data-driven knowledge discovery models faces multiple data-- and model-development--related issues. These issues should be carefully addressed by allowing a flexible, customized, and industry-specific knowledge discovery framework; in our case, this takes the form of the cross-industry standard process for data mining (CRISP-DM). This framework is designed to ensure active cooperation between different phases to adequately address data- and model-related issues. In this paper, we review several extensions of CRISP-DM models and various data-robustness-- and model-robustness--related problems in machine learning, which currently lacks proper cooperation between data experts and business experts because of the limitations of data-driven knowledge discovery models.
arXiv.org Artificial Intelligence
Jul-28-2020
- Country:
- Europe (1.00)
- North America > United States (0.93)
- Genre:
- Overview (1.00)
- Research Report (1.00)
- Industry:
- Energy > Oil & Gas (0.46)
- Health & Medicine > Therapeutic Area (0.46)
- Information Technology (1.00)
- Materials > Metals & Mining (0.48)
- Technology:
- Information Technology
- Artificial Intelligence
- Machine Learning
- Learning Graphical Models > Directed Networks
- Bayesian Learning (0.46)
- Neural Networks > Deep Learning (0.68)
- Performance Analysis > Accuracy (0.46)
- Statistical Learning > Regression (0.67)
- Learning Graphical Models > Directed Networks
- Representation & Reasoning
- Agents (1.00)
- Uncertainty (1.00)
- Machine Learning
- Data Science > Data Mining
- Knowledge Discovery (1.00)
- Artificial Intelligence
- Information Technology