AITopics

2109.15063

Country:

Europe > Germany > Baden-Württemberg > Karlsruhe Region > Karlsruhe (0.04)
Europe > Greece > Attica > Athens (0.04)

Genre:

Workflow (1.00)
Research Report > Promising Solution (0.65)

Industry:

Health & Medicine > Surgery (1.00)
Health & Medicine > Therapeutic Area > Ophthalmology/Optometry (0.68)

Technology:

Information Technology > Artificial Intelligence > Vision > Image Understanding (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Crespi, Leonardo, Loiacono, Daniele, Chiti, Arturo

Chest X-Rays Image Classification from beta-Variational Autoencoders Latent Features

arXiv.org Artificial IntelligenceSep-29-2021

Chest X-Ray (CXR) is one of the most common diagnostic techniques used in everyday clinical practice all around the world. We hereby present a work which intends to investigate and analyse the use of Deep Learning (DL) techniques to extract information from such images and allow to classify them, trying to keep our methodology as general as possible and possibly also usable in a real world scenario without much effort, in the future. To move in this direction, we trained several beta-Variational Autoencoder (beta-VAE) models on the CheXpert dataset, one of the largest publicly available collection of labeled CXR images; from these models, latent features have been extracted and used to train other Machine Learning models, able to classify the original images from the features extracted by the beta-VAE. Lastly, tree-based models have been combined together in ensemblings to improve the results without the necessity of further training or models engineering. Expecting some drop in pure performance with the respect to state of the art classification specific models, we obtained encouraging results, which show the viability of our approach and the usability of the high level features extracted by the autoencoders for classification tasks.

artificial intelligence, deep learning, machine learning, (15 more...)

doi: 10.1109/SSCI50451.2021.9660190

2109.1476

Country:

Europe > Italy > Lombardy > Milan (0.05)
North America > United States (0.04)

Genre: Research Report > New Finding (0.46)

Industry: Health & Medicine > Diagnostic Medicine > Imaging (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

Schwab, Evan, Cula, Gabriela Oana, Standish, Kristopher, Yip, Stephen S. F., Stojmirovic, Aleksandar, Ghanem, Louis, Chehoud, Christel

Automatic Estimation of Ulcerative Colitis Severity from Endoscopy Videos using Ordinal Multi-Instance Learning

arXiv.org Artificial IntelligenceSep-29-2021

Ulcerative colitis (UC) is a chronic inflammatory bowel disease characterized by relapsing inflammation of the large intestine. The severity of UC is often represented by the Mayo Endoscopic Subscore (MES) which quantifies mucosal disease activity from endoscopy videos. In clinical trials, an endoscopy video is assigned an MES based upon the most severe disease activity observed in the video. For this reason, severe inflammation spread throughout the colon will receive the same MES as an otherwise healthy colon with severe inflammation restricted to a small, localized segment. Therefore, the extent of disease activity throughout the large intestine, and overall response to treatment, may not be completely captured by the MES. In this work, we aim to automatically estimate UC severity for each frame in an endoscopy video to provide a higher resolution assessment of disease activity throughout the colon. Because annotating severity at the frame-level is expensive, labor-intensive, and highly subjective, we propose a novel weakly supervised, ordinal classification method to estimate frame severity from video MES labels alone. Using clinical trial data, we first achieved 0.92 and 0.90 AUC for predicting mucosal healing and remission of UC, respectively. Then, for severity estimation, we demonstrate that our models achieve substantial Cohen's Kappa agreement with ground truth MES labels, comparable to the inter-rater agreement of expert clinicians. These findings indicate that our framework could serve as a foundation for novel clinical endpoints, based on a more localized scoring system, to better evaluate UC drug efficacy in clinical trials.

agreement, classification, video, (14 more...)

2109.14685

Genre: Research Report > Experimental Study (1.00)

Industry:

Health & Medicine > Therapeutic Area > Gastroenterology (1.00)
Health & Medicine > Diagnostic Medicine > Imaging (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.47)

arXiv.org Machine LearningSep-29-2021

Deep neural networks with controlled variable selection for the identification of putative causal genetic variants

Kassani, Peyman H., Lu, Fred, Guen, Yann Le, He, Zihuai

Deep neural networks (DNN) have been used successfully in many scientific problems for their high prediction accuracy, but their application to genetic studies remains challenging due to their poor interpretability. In this paper, we consider the problem of scalable, robust variable selection in DNN for the identification of putative causal genetic variants in genome sequencing studies. We identified a pronounced randomness in feature selection in DNN due to its stochastic nature, which may hinder interpretability and give rise to misleading results. We propose an interpretable neural network model, stabilized using ensembling, with controlled variable selection for genetic studies. The merit of the proposed method includes: (1) flexible modelling of the non-linear effect of genetic variants to improve statistical power; (2) multiple knockoffs in the input layer to rigorously control false discovery rate; (3) hierarchical layers to substantially reduce the number of weight parameters and activations to improve computational efficiency; (4) de-randomized feature selection to stabilize identified signals. We evaluated the proposed method in extensive simulation studies and applied it to the analysis of Alzheimer's disease genetics. We showed that the proposed method, when compared to conventional linear and nonlinear methods, can lead to substantially more discoveries. Introduction Recent advances in whole genome sequencing (WGS) technology have led the way to explore the contribution of common and rare variants in both coding and non-coding regions towards risk for complex traits. Large-scale genome sequencing studies, such as the Trans-Omics for Precision Medicine (TOPMed) study and the Alzheimer's Disease Sequencing Project (ADSP), have collected thousands of samples with directly sequenced whole genomes. Genetic variants or genes below a p-value threshold are deemed as associated variants. The marginal association tests are well-known for their simplicity and effectiveness, but they often identify proxy variants that are only correlated with the true causal variants, and the statistical power can be suboptimal. One obstacle for the widespread application of DNN to genetic data is their interpretability.

genetic variant, hide-mk, variant, (16 more...)

2109.14719

Country:

North America > United States > California > Santa Clara County > Stanford (0.04)
North America > United States > Michigan (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)

Genre:

Research Report > New Finding (0.93)
Research Report > Experimental Study (0.88)

Industry:

Health & Medicine > Therapeutic Area > Neurology > Alzheimer's Disease (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Mary-Huard, Tristan, Perduca, Vittorio, Blanchard, Gilles, Marie-Laure, Martin-Magniette

Error rate control for classification rules in multiclass mixture models

arXiv.org Machine LearningSep-29-2021

In the context of finite mixture models one considers the problem of classifying as many observations as possible in the classes of interest while controlling the classification error rate in these same classes. Similar to what is done in the framework of statistical test theory, different type I and type II-like classification error rates can be defined, along with their associated optimal rules, where optimality is defined as minimizing type II error rate while controlling type I error rate at some nominal level. It is first shown that finding an optimal classification rule boils down to searching an optimal region in the observation space where to apply the classical Maximum A Posteriori (MAP) rule. Depending on the misclassification rate to be controlled, the shape of the optimal region is provided, along with a heuristic to compute the optimal classification rule in practice. In particular, a multiclass FDR-like optimal rule is defined and compared to the thresholded MAP rules that is used in most applications. It is shown on both simulated and real datasets that the FDR-like optimal rule may be significantly less conservative than the thresholded MAP rule.

classification rule, optimal rule, posterior probability, (14 more...)

2109.14235

Country: North America > United States > New York (0.04)

Genre: Research Report (1.00)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)

#artificialintelligenceSep-28-2021, 11:35:12 GMT

Artificial Intelligence System Improves Breast Cancer Detection

Breast cancer is the second most common cancer among women in the United States; as of January 2021, there are more than 3.8 million women with a history of breast cancer in the United States. Doctors often use ultrasound, mammograms, MRI, or biopsy to find or diagnose breast cancer. In a new study, researchers from NYU and NYU Abu Dhabi (NYUAD) report that they have developed a novel artificial intelligence (AI) system that achieves radiologist-level accuracy in identifying breast cancer in ultrasound images. Their findings are published in the journal Nature Communications, in a paper titled, "Artificial intelligence system reduces false-positive findings in the interpretation of breast ultrasound exams," and was led by Farah Shamout, PhD, NYUAD assistant professor emerging scholar of computer engineering and colleagues. "Though consistently shown to detect mammographically occult cancers, breast ultrasound has been noted to have high false-positive rates, the researchers wrote. "In this work, we present an AI system that achieves radiologist-level accuracy in identifying breast cancer in ultrasound images." "The AI system was developed and evaluated using the NYU Breast Ultrasound Dataset41 consisting of 5,442,907 images within 288,767 breast exams (including both screening and diagnostic exams) collected from 143,203 patients examined between 2012 and 2019 at NYU Langone Health in New York," noted the researchers. The primary goal of the AI system is to reduce the frequency of false-positive findings. It can detect cancer by assigning a probability for malignancy and highlight parts of ultrasound images that are associated with its predictions. When the researchers conducted a reader study to compare its diagnostic accuracy with board-certified breast radiologists, the system achieved higher accuracy than the ten radiologists on average. However, a hybrid model that aggregated the predictions of the AI system and radiologists achieved the best results in accurately detecting cancer in patients. "Our findings highlight the potential of AI to improve the accuracy, consistency, and efficiency of breast ultrasound diagnosis," explained Shamout. "Importantly, AI is not a replacement for the expertise of clinicians.

ai system, cancer, radiologist, (11 more...)

#artificialintelligence

Country:

North America > United States > New York (0.26)
Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.26)

Genre: Research Report > New Finding (0.58)

Industry: Health & Medicine > Therapeutic Area > Oncology > Breast Cancer (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)

Seethi, Venkata Devesh Reddy, LaCasse, Zane, Chivte, Prajkta, Gaillard, Elizabeth R., Bharti, Pratool

An Explainable-AI approach for Diagnosis of COVID-19 using MALDI-ToF Mass Spectrometry

arXiv.org Artificial IntelligenceSep-28-2021

The novel severe acute respiratory syndrome coronavirus type-2 (SARS-CoV-2) caused a global pandemic that has taken more than 4.5 million lives and severely affected the global economy. To curb the spread of the virus, an accurate, cost-effective, and quick testing for large populations is exceedingly important in order to identify, isolate, and treat infected people. Current testing methods commonly use PCR (Polymerase Chain Reaction) based equipment that have limitations on throughput, cost-effectiveness, and simplicity of procedure which creates a compelling need for developing additional coronavirus disease-2019 (COVID-19) testing mechanisms, that are highly sensitive, rapid, trustworthy, and convenient to use by the public. We propose a COVID-19 testing method using artificial intelligence (AI) techniques on MALDI-ToF (matrix-assisted laser desorption/ionization time-of-flight) data extracted from 152 human gargle samples (60 COVID-19 positive tests and 92 COVID-19 negative tests). Our AI-based approach leverages explainable-AI (X-AI) methods to explain the decision rules behind the predictive algorithm both on a local (per-sample) and global (all-samples) basis to make the AI model more trustworthy. Finally, we evaluated our proposed method using a 70%-30% train-test-split strategy and achieved a training accuracy of 86.79% and a testing accuracy of 91.30%.

algorithm, feature importance, prediction, (17 more...)

2109.14099

Country: North America > United States > Illinois > DeKalb County > DeKalb (0.04)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Data Science > Data Mining (0.93)
Information Technology > Artificial Intelligence > Natural Language > Explanation & Argumentation (0.84)
(3 more...)

arXiv.org Machine LearningSep-28-2021

Non-stationary Gaussian process discriminant analysis with variable selection for high-dimensional functional data

Yu, W, Wade, S, Bondell, H D, Azizi, L

High-dimensional classification and feature selection tasks are ubiquitous with the recent advancement in data acquisition technology. In several application areas such as biology, genomics and proteomics, the data are often functional in their nature and exhibit a degree of roughness and non-stationarity. These structures pose additional challenges to commonly used methods that rely mainly on a two-stage approach performing variable selection and classification separately. We propose in this work a novel Gaussian process discriminant analysis (GPDA) that combines these steps in a unified framework. Our model is a two-layer non-stationary Gaussian process coupled with an Ising prior to identify differentially-distributed locations. Scalable inference is achieved via developing a variational scheme that exploits advances in the use of sparse inverse covariance matrices. We demonstrate the performance of our methodology on simulated datasets and two proteomics datasets: breast cancer and SARS-CoV-2. Our approach distinguishes itself by offering explainability as well as uncertainty quantification in addition to low computational cost, which are crucial to increase trust and social acceptance of data-driven tools.

discriminant analysis, supplementary material, variable selection, (14 more...)

2109.14171

Genre: Research Report (0.50)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.89)
Health & Medicine > Therapeutic Area > Oncology (0.89)
Health & Medicine > Therapeutic Area > Pulmonary/Respiratory Diseases (0.57)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
(4 more...)

arXiv.org Machine LearningSep-28-2021

Federated Learning Algorithms for Generalized Mixed-effects Model (GLMM) on Horizontally Partitioned Data from Distributed Sources

Li, Wentao, Tong, Jiayi, Anjum, Md. Monowar, Mohammed, Noman, Chen, Yong, Jiang, Xiaoqian

Objectives: This paper develops two algorithms to achieve federated generalized linear mixed effect models (GLMM), and compares the developed model's outcomes with each other, as well as that from the standard R package (`lme4'). Methods: The log-likelihood function of GLMM is approximated by two numerical methods (Laplace approximation and Gaussian Hermite approximation), which supports federated decomposition of GLMM to bring computation to data. Results: Our developed method can handle GLMM to accommodate hierarchical data with multiple non-independent levels of observations in a federated setting. The experiment results demonstrate comparable (Laplace) and superior (Gaussian-Hermite) performances with simulated and real-world data. Conclusion: We developed and compared federated GLMMs with different approximations, which can support researchers in analyzing biomedical data to accommodate mixed effects and address non-independence due to hierarchical structures (i.e., institutes, region, country, etc.).

approximation, glmm, laplace approximation, (14 more...)

2109.14046

Country:

North America > United States > Pennsylvania (0.04)
North America > United States > Louisiana > Saint John the Baptist Parish > Laplace (0.04)
North America > Canada > Manitoba > Winnipeg Metropolitan Region > Winnipeg (0.04)
Asia > Middle East > Jordan (0.04)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (0.93)
Health & Medicine > Therapeutic Area > Pulmonary/Respiratory Diseases (0.68)
Health & Medicine > Government Relations & Public Policy (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.96)

Guggilam, Sreelekha, Chandola, Varun, Patra, Abani

Anomaly Detection for High-Dimensional Data Using Large Deviations Principle

arXiv.org Machine LearningSep-28-2021

Most current anomaly detection methods suffer from the curse of dimensionality when dealing with high-dimensional data. We propose an anomaly detection algorithm that can scale to high-dimensional data using concepts from the theory of large deviations. The proposed Large Deviations Anomaly Detection (LAD) algorithm is shown to outperform state of art anomaly detection methods on a variety of large and high-dimensional benchmark data sets. Exploiting the ability of the algorithm to scale to high-dimensional data, we propose an online anomaly detection method to identify anomalies in a collection of multivariate time series. We demonstrate the applicability of the online algorithm in identifying counties in the United States with anomalous trends in terms of COVID-19 related cases and deaths. Several of the identified anomalous counties correlate with counties with documented poor response to the COVID pandemic.

dataset, detection, time sery, (11 more...)

2109.13698

Country:

North America > United States > New York > Erie County > Buffalo (0.04)
North America > United States > Michigan > Wayne County > Wayne (0.04)
North America > United States > Wyoming > Albany County > Laramie (0.04)
(10 more...)

Genre: Research Report (0.40)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (0.88)

Technology:

Information Technology > Data Science > Data Mining > Anomaly Detection (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.68)