leak probability
Machine Learning in High Volume Media Manufacturing
Karuka, Siddarth Reddy, Sunderrajan, Abhinav, Zheng, Zheng, Tiean, Yong Woon, Nagappan, Ganesh, Luk, Allan
Errors or failures in a high-volume manufacturing environment can have significant impact that can result in both the loss of time and money. Identifying such failures early has been a top priority for manufacturing industries and various rule-based algorithms have been developed over the years. However, catching these failures is time consuming and such algorithms cannot adapt well to changes in designs, and sometimes variations in everyday behavior. More importantly, the number of units to monitor in a high-volume manufacturing environment is too big for manual monitoring or for a simple program. Here we develop a novel program that combines both rule-based decisions and machine learning models that can not only learn and adapt to such day-to-day variations or long-term design changes, but also can be applied at scale to the high number of manufacturing units in use today. Using the current state-of-the-art technologies, we then deploy this program at-scale to handle the needs of ever-increasing demand from the manufacturing environment.
- Asia > Singapore (0.04)
- North America > United States > Oregon > Multnomah County > Portland (0.04)
- North America > United States > Minnesota > Hennepin County > Bloomington (0.04)
- North America > United States > Colorado > Boulder County > Longmont (0.04)
- Workflow (0.48)
- Research Report (0.40)
- Media (0.40)
- Information Technology (0.31)
A Latent Variable Model for Discovering Bird Species Commonly Misidentified by Citizen Scientists
Yu, Jun (Oregon State University) | Hutchinson, Rebecca A. (Oregon State University) | Wong, Weng-Keen (Oregon State University)
Data quality is a common source of concern for large-scale citizen science projects like eBird. In the case of eBird, a major cause of poor quality data is the misidentification of bird species by inexperienced contributors. A proactive approach for improving data quality is to discover commonly misidentified bird species and to teach inexperienced birders the differences between these species. To accomplish this goal, we develop a latent variable graphical model that can identify groups of bird species that are often confused for each other by eBird participants. Our model is a multi-species extension of the classic occupancy-detection model in the ecology literature. This multi-species extension requires a structure learning step as well as a computationally expensive parameter learning stage which we make efficient through a variational approximation. We show that our model can not only discover groups of misidentified species, but by including these misidentifications in the model, it can also achieve more accurate predictions of both species occupancy and detection.
- North America > United States > California (0.05)
- Asia > Middle East > Jordan (0.04)
- North America > United States > Oregon (0.04)
- (2 more...)
Why Is Diagnosis Using Belief Networks Insensitive to Imprecision In Probabilities?
Henrion, Max, Pradhan, Malcolm, del Favero, Brendan, Huang, Kurt, Provan, Gregory M., O'Rorke, Paul
Recent research has found that diagnostic performance with Bayesian belief networks is often surprisingly insensitive to imprecision in the numerical probabilities. For example, the authors have recently completed an extensive study in which they applied random noise to the numerical probabilities in a set of belief networks for medical diagnosis, subsets of the CPCS network, a subset of the QMR (Quick Medical Reference) focused on liver and bile diseases. The diagnostic performance in terms of the average probabilities assigned to the actual diseases showed small sensitivity even to large amounts of noise. In this paper, we summarize the findings of this study and discuss possible explanations of this low sensitivity. One reason is that the criterion for performance is average probability of the true hypotheses, rather than average error in probability, which is insensitive to symmetric noise distributions. But, we show that even asymmetric, logodds-normal noise has modest effects. A second reason is that the gold-standard posterior probabilities are often near zero or one, and are little disturbed by noise.
- North America > United States > District of Columbia > Washington (0.14)
- North America > United States > New York (0.04)
- North America > United States > California > Santa Clara County > Stanford (0.04)
- (8 more...)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Diagnosis (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)