Goto

Collaborating Authors

 Diagnosis


MVCNet: Multiview Contrastive Network for Unsupervised Representation Learning for 3D CT Lesions

arXiv.org Artificial Intelligence

\emph{Objective and Impact Statement}. With the renaissance of deep learning, automatic diagnostic systems for computed tomography (CT) have achieved many successful applications. However, they are mostly attributed to careful expert annotations, which are often scarce in practice. This drives our interest to the unsupervised representation learning. \emph{Introduction}. Recent studies have shown that self-supervised learning is an effective approach for learning representations, but most of them rely on the empirical design of transformations and pretext tasks. \emph{Methods}. To avoid the subjectivity associated with these methods, we propose the MVCNet, a novel unsupervised three dimensional (3D) representation learning method working in a transformation-free manner. We view each 3D lesion from different orientations to collect multiple two dimensional (2D) views. Then, an embedding function is learned by minimizing a contrastive loss so that the 2D views of the same 3D lesion are aggregated, and the 2D views of different lesions are separated. We evaluate the representations by training a simple classification head upon the embedding layer. \emph{Results}. Experimental results show that MVCNet achieves state-of-the-art accuracies on the LIDC-IDRI (89.55\%), LNDb (77.69\%) and TianChi (79.96\%) datasets for \emph{unsupervised representation learning}. When fine-tuned on 10\% of the labeled data, the accuracies are comparable to the supervised learning model (89.46\% vs. 85.03\%, 73.85\% vs. 73.44\%, 83.56\% vs. 83.34\% on the three datasets, respectively). \emph{Conclusion}. Results indicate the superiority of MVCNet in \emph{learning representations with limited annotations}.


Nanosecond machine learning event classification with boosted decision trees in FPGA for high energy physics

arXiv.org Artificial Intelligence

We present a novel implementation of classification using the machine learning / artificial intelligence method called boosted decision trees (BDT) on field programmable gate arrays (FPGA). The firmware implementation of binary classification requiring 100 training trees with a maximum depth of 4 using four input variables gives a latency value of about 10 ns, independent of the clock speed from 100 to 320 MHz in our setup. The low timing values are achieved by restructuring the BDT layout and reconfiguring its parameters. The FPGA resource utilization is also kept low at a range from 0.01% to 0.2% in our setup. A software package called fwXmachina achieves this implementation. Our intended user is an expert of custom electronics-based trigger systems in high energy physics experiments or anyone that needs decisions at the lowest latency values for real-time event classification. Two problems from high energy physics are considered, in the separation of electrons vs. photons and in the selection of vector boson fusion-produced Higgs bosons vs. the rejection of the multijet processes.


On the Explanatory Power of Decision Trees

arXiv.org Artificial Intelligence

Decision trees have long been recognized as models of choice in sensitive applications where interpretability is of paramount importance. In this paper, we examine the computational ability of Boolean decision trees in deriving, minimizing, and counting sufficient reasons and contrastive explanations. We prove that the set of all sufficient reasons of minimal size for an instance given a decision tree can be exponentially larger than the size of the input (the instance and the decision tree). Therefore, generating the full set of sufficient reasons can be out of reach. In addition, computing a single sufficient reason does not prove enough in general; indeed, two sufficient reasons for the same instance may differ on many features. To deal with this issue and generate synthetic views of the set of all sufficient reasons, we introduce the notions of relevant features and of necessary features that characterize the (possibly negated) features appearing in at least one or in every sufficient reason, and we show that they can be computed in polynomial time. We also introduce the notion of explanatory importance, that indicates how frequent each (possibly negated) feature is in the set of all sufficient reasons. We show how the explanatory importance of a feature and the number of sufficient reasons can be obtained via a model counting operation, which turns out to be practical in many cases. We also explain how to enumerate sufficient reasons of minimal size. We finally show that, unlike sufficient reasons, the set of all contrastive explanations for an instance given a decision tree can be derived, minimized and counted in polynomial time.


Regression Trees

#artificialintelligence

This Blog assumes that the reader is familiar with the concept of Decision Trees and Regression. If not, refer to the blogs below. Having read the above blogs or Having already being familiar with the appropriate topics, you hopefully understand what is a decision tree, by now ( The one we used for classification task). A regression tree is basically a decision tree that is used for the task of regression which can be used to predict continuous valued outputs instead of discrete outputs. In Decision Trees for Classification, we saw how the tree asks right questions at the right node in order to give accurate and efficient classifications.


Researchers use machine learning to modify the current PTSD diagnostic criteria - Mental Daily

#artificialintelligence

A group of researchers from the Boston University School of Public Health and the VA Boston Healthcare System utilized machine learning to streamline the diagnosis tool for post-traumatic stress disorder (PTSD). According to their new study, released in the journal Assessment, some of the questions imposed in the Structural Clinical Interview for the Diagnostic Statistical Manual of Mental Disorders, Fifth Edition (SCID-5) could be eliminated, leading to more relevancy of the veteran population. "Our study is only a first step--but an important one, because it shows that machine learning methods can be used to help inform efforts to make care more efficient, without sacrificing or degrading the quality of care provided," said co-author Jaimie Graudus, in a news release. The new research included data from the SCID-5 assessments related to more than 1,200 military soldiers, half of which were male and the rest female, who served during the Afghanistan and Iraq conflicts. The use of random forests, a form of machine-learning system, was also incorporated into the study.


Deep Metric Learning Model for Imbalanced Fault Diagnosis

arXiv.org Artificial Intelligence

Intelligent diagnosis method based on data-driven and deep learning is an attractive and meaningful field in recent years. However, in practical application scenarios, the imbalance of time-series fault is an urgent problem to be solved. This paper proposes a novel deep metric learning model, where imbalanced fault data and a quadruplet data pair design manner are considered. Based on such data pair, a quadruplet loss function which takes into account the inter-class distance and the intra-class data distribution are proposed. This quadruplet loss pays special attention to imbalanced sample pair. The reasonable combination of quadruplet loss and softmax loss function can reduce the impact of imbalance. Experiment results on two open-source datasets show that the proposed method can effectively and robustly improve the performance of imbalanced fault diagnosis.


Decision Tree Algorithms-Machine Learning

#artificialintelligence

Decision Tree Algorithm one of the easiest and popular Algorithms to predict the output. The Decision Tree Algorithm is a part of the supervised machine learning algorithm. Here the problem is represented in a form of a tree to predict the outcome. This algorithm aims to create a model that should predict the value of a variable that is targeted, and for this purpose, it is represented in a form of a decision tree. It is used for classification problems and also for regression problems.


Explainable AI (XAI) for PHM of Industrial Asset: A State-of-The-Art, PRISMA-Compliant Systematic Review

arXiv.org Artificial Intelligence

A state-of-the-art systematic review on XAI applied to Prognostic and Health Management (PHM) of industrial asset is presented. The work attempts to provide an overview of the general trend of XAI in PHM, answers the question of accuracy versus explainability, investigates the extent of human role, explainability evaluation and uncertainty management in PHM XAI. Research articles linked to PHM XAI, in English language, from 2015 to 2021 are selected from IEEE Xplore, ScienceDirect, SpringerLink, ACM Digital Library and Scopus databases using PRISMA guidelines. Data was extracted from 35 selected articles and examined using MS. Excel. Several findings were synthesized. Firstly, while the discipline is still young, the analysis indicates the growing acceptance of XAI in PHM domain. Secondly, XAI functions as a double edge sword, where it is assimilated as a tool to execute PHM tasks as well as a mean of explanation, in particular in diagnostic and anomaly detection. There is thus a need for XAI in PHM. Thirdly, the review shows that PHM XAI papers produce either good or excellent results in general, suggesting that PHM performance is unaffected by XAI. Fourthly, human role, explainability metrics and uncertainty management are areas requiring further attention by the PHM community. Adequate explainability metrics to cater for PHM need are urgently needed. Finally, most case study featured on the accepted articles are based on real, indicating that available AI and XAI approaches are equipped to solve complex real-world challenges, increasing the confidence of AI model adoption in the industry. This work is funded by the Universiti Teknologi Petronas Foundation.


Chest X-ray AI solution which detects 124 clinical findings launched

#artificialintelligence

The company was originally formed as a joint venture between Australian healthcare technology company Harrison.ai The launch of Annalise CXR coincides with its publication of a peer-reviewed diagnostic accuracy study published by The Lancet Digital Health, which is the largest of its kind ever undertaken in terms of the number of findings concurrently evaluated. The study found that when used as an assist device, Annalise CXR significantly improved the ability for radiologists to perceive 102 chest X-ray (CXR) findings in a non-clinical environment, was statistically non-inferior for 19 findings and no findings showed a decrease in accuracy. The study also assessed the standalone performance of the model in a non-clinical environment against radiologists in identifying chest x-ray pathology, as well as investigating the effect of model output on radiologist performance when used as an assist device. Annalise CXR's AI model classification alone was significantly more accurate than unassisted radiologists for 117 (94%) of 124 clinical findings predicted by the model and was non-inferior to unassisted radiologists for all other clinical findings.


The Hyperspherical Geometry of Community Detection: Modularity as a Distance

arXiv.org Machine Learning

The Louvain algorithm is currently one of the most popular community detection methods. This algorithm finds communities by maximizing a quantity called modularity. In this work, we describe a metric space of clusterings, where clusterings are described by a binary vector indexed by the vertex-pairs. We extend this geometry to a hypersphere and prove that maximizing modularity is equivalent to minimizing the angular distance to some modularity vector over the set of clustering vectors. This equivalence allows us to view the Louvain algorithm as a nearest-neighbor search that approximately minimizes the distance to this modularity vector. By replacing this modularity vector by a different vector, many alternative community detection methods can be obtained. We explore this wider class and compare it to existing modularity-based methods. Our experiments show that these alternatives may outperform modularity-based methods. For example, when communities are large compared to vertex neighborhoods, a vector based on numbers of common neighbors outperforms existing community detection methods. While the focus of the present work is community detection in networks, the proposed methodology can be applied to any clustering problem where pair-wise similarity data is available.