Diagnosis
Application of machine learning for hematological diagnosis
Gunčar, Gregor, Kukar, Matjaž, Notar, Mateja, Brvar, Miran, Černelč, Peter, Notar, Manca, Notar, Marko
Quick and accurate medical diagnosis is crucial for the successful treatment of a disease. Using machine learning algorithms, we have built two models to predict a hematologic disease, based on laboratory blood test results. In one predictive model, we used all available blood test parameters and in the other a reduced set, which is usually measured upon patient admittance. Both models produced good results, with a prediction accuracy of 0.88 and 0.86, when considering the list of five most probable diseases, and 0.59 and 0.57, when considering only the most probable disease. Models did not differ significantly from each other, which indicates that a reduced set of parameters contains a relevant fingerprint of a disease, expanding the utility of the model for general practitioner's use and indicating that there is more information in the blood test results than physicians recognize. In the clinical test we showed that the accuracy of our predictive models was on a par with the ability of hematology specialists. Our study is the first to show that a machine learning predictive model based on blood tests alone, can be successfully applied to predict hematologic diseases and could open up unprecedented possibilities in medical diagnosis.
How machine learning and financial technology are transforming the lending sector
The lending ecosystem around the world has been at the centre of significant changes in the last decade. From financial technology disrupting the financial services sector industry with highly efficient and cost-effective processes, to stringent regulations following the 2008 global financial crisis, the growing technological intervention has played a significant role in the rapid evolution of the lending industry. One such technology is machine learning which has begun to create new and highly promising avenues in the lending market. Machine learning is a Predictive Model Algorithm that develops Artificial Intelligence around large sets of data through different predictive statistical techniques (such as Logistic Regression, Random Forest, Decision Tree etc.) and imparts decisions/insights based on the data it processes. Machines can be taught to identify any form of data which is stored electronically such as texts, images, speech, etc. and analyse by the machine through such algorithms to identify behaviours, patterns etc. and generate similar predictions when imposed on a new dataset. Fintech companies are increasingly augmenting the applications of machine learning algorithms in their operations to build efficient and effective systems.
Causes for Query Answers from Databases: Datalog Abduction, View-Updates, and Integrity Constraints
Bertossi, Leopoldo, Salimi, Babak
Causality has been recently introduced in databases, to model, characterize, and possibly compute causes for query answers. Connections between QA-causality and consistency-based diagnosis and database repairs (wrt. integrity constraint violations) have already been established. In this work we establish precise connections between QA-causality and both abductive diagnosis and the view-update problem in databases, allowing us to obtain new algorithmic and complexity results for QA-causality. We also obtain new results on the complexity of view-conditioned causality, and investigate the notion of QA-causality in the presence of integrity constraints, obtaining complexity results from a connection with view-conditioned causality. The abduction connection under integrity constraints allows us to obtain algorithmic tools for QA-causality.
Discover structure behind data with decision trees - Vooban
Let's understand and model the hidden structure behind data with Decision Trees. In this tutorial, we'll explore and inspect how a model can do its decisions on a car evaluation data set. Decision trees work with simple "if" clauses dichotomically chained together, splitting the data flow recursively on those "if"s until they reach a leaf where we can categorize the data. Such data inspection could be used to reverse engineer the behavior of any function. Since decision trees are good algorithms for discovering the structure hidden behind data, we'll use and model the car evaluation data set, for which the prediction problem is a (deterministic) surjective function.
Displayr Machine Learning: Pruning Decision Trees
Machine learning is a problem of trade-offs. The classic issue is overfitting versus underfitting. Overfitting happens when a model memorizes its training data so well that it is learning noise on top of the signal. Underfitting is the opposite: the model is too simple to find the patterns in the data. Simplicity versus accuracy is a similar consideration. Do you want a model that can fit onto one sheet of paper and be understood by a broad audience?
Two-Class Boosted Decision Tree
Two-Class Boosted Decision Tree module creates a machine learning model that is based on the boosted decision trees algorithm. A boosted decision tree is an ensemble learning method in which the second tree corrects for the errors of the first tree, the third tree corrects for the errors of the first and second trees, and so forth. Predictions are based on the entire ensemble of trees together that makes the prediction. Step 1 Add the Boosted Decision Tree module to the experiment. Step 2 Specify how you want the model to be trained, by setting the Create trainer mode option.
Survey on Models and Techniques for Root-Cause Analysis
Solé, Marc, Muntés-Mulero, Victor, Rana, Annie Ibrahim, Estrada, Giovani
Automation and computer intelligence to support complex human decisions becomes essential to manage large and distributed systems in the Cloud and IoT era. Understanding the root cause of an observed symptom in a complex system has been a major problem for decades. As industry dives into the IoT world and the amount of data generated per year grows at an amazing speed, an important question is how to find appropriate mechanisms to determine root causes that can handle huge amounts of data or may provide valuable feedback in real-time. While many survey papers aim at summarizing the landscape of techniques for modelling system behavior and infering the root cause of a problem based in the resulting models, none of those focuses on analyzing how the different techniques in the literature fit growing requirements in terms of performance and scalability. In this survey, we provide a review of root-cause analysis, focusing on these particular aspects. We also provide guidance to choose the best root-cause analysis strategy depending on the requirements of a particular system and application.
WatsonPaths: Scenario-Based Question Answering and Inference over Unstructured Information
Lally, Adam (Information Technology and Services) | Bagchi, Sugato (IBM Research) | Barborak, Michael A. (IBM T. J. Watson Research Center) | Buchanan, David W. (IBM T. J. Watson Research Center) | Chu-Carroll, Jennifer (IBM Research) | Ferrucci, David A. (Bridgewater) | Glass, Michael R. (IBM Research) | Kalyanpur, Aditya (IBM T. J. Watson Research Center) | Mueller, Erik T. (Capital One) | Murdock, J. William (IBM T. J. Watson Research Center) | Patwardhan, Siddharth (IBM T. J. Watson Research Center) | Prager, John M. (IBM T. J. Watson Research Center)
We present WatsonPaths, a novel system that can answer scenario-based questions. These include medical questions that present a patient summary and ask for the most likely diagnosis or most appropriate treatment. WatsonPaths builds on the IBM Watson question answering system. WatsonPaths breaks down the input scenario into individual pieces of information, asks relevant subquestions of Watson to conclude new information, and represents these results in a graphical model. Probabilistic inference is performed over the graph to conclude the answer. On a set of medical test preparation questions, WatsonPaths shows a significant improvement in accuracy over multiple baselines.
Probabilistic Active Learning of Functions in Structural Causal Models
Rubenstein, Paul K., Tolstikhin, Ilya, Hennig, Philipp, Schoelkopf, Bernhard
We consider the problem of learning the functions computing children from parents in a Structural Causal Model once the underlying causal graph has been identified. This is in some sense the second step after causal discovery. Taking a probabilistic approach to estimating these functions, we derive a natural myopic active learning scheme that identifies the intervention which is optimally informative about all of the unknown functions jointly, given previously observed data. We test the derived algorithms on simple examples, to demonstrate that they produce a structured exploration policy that significantly improves on unstructured base-lines.
5 Early Signs And Symptoms Of Diabetes
Type diabetes is a condition that occurs when the body is not able to use the hormone insulin properly. It affects around 29 million Americans, and is caused by high blood sugar levels, often the result of poor diet and exercise. Here are some of the early signs and symptoms of this popular health disorder. According to Everyday Health, when there is extra glucose in the blood, the kidneys attempt to flush it out. As a result, patients will see an increase in urine production and the urge to use the bathroom.