AITopics | Performance Analysis

Collaborating Authors

Performance Analysis

News Overviews Instructional Materials AI-Alerts Classics

SAIA: Split Artificial Intelligence Architecture for Mobile Healthcare System

Zhuang, Di, Nguyen, Nam, Chen, Keyu, Chang, J. Morris

arXiv.org Artificial IntelligenceMay-9-2020

As the advancement of deep learning (DL), the Internet of Things and cloud computing techniques for biomedical and healthcare problems, mobile healthcare systems have received unprecedented attention. Since DL techniques usually require enormous amount of computation, most of them cannot be directly deployed on the resource-constrained mobile and IoT devices. Hence, most of the mobile healthcare systems leverage the cloud computing infrastructure, where the data collected by the mobile and IoT devices would be transmitted to the cloud computing platforms for analysis. However, in the contested environments, relying on the cloud might not be practical at all times. For instance, the satellite communication might be denied or disrupted. We propose SAIA, a Split Artificial Intelligence Architecture for mobile healthcare systems. Unlike traditional approaches for artificial intelligence (AI) which solely exploits the computational power of the cloud server, SAIA could not only relies on the cloud computing infrastructure while the wireless communication is available, but also utilizes the lightweight AI solutions that work locally on the client side, hence, it can work even when the communication is impeded. In SAIA, we propose a meta-information based decision unit, that could tune whether a sample captured by the client should be operated by the embedded AI (i.e., keeping on the client) or the networked AI (i.e., sending to the server), under different conditions. In our experimental evaluation, extensive experiments have been conducted on two popular healthcare datasets. Our results show that SAIA consistently outperforms its baselines in terms of both effectiveness and efficiency.

data mining, decision unit, machine learning, (21 more...)

arXiv.org Artificial Intelligence

2004.12059

Country:

North America > United States > Florida > Hillsborough County > Tampa (0.14)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)

Genre: Research Report > New Finding (0.86)

Industry:

Health & Medicine > Government Relations & Public Policy (1.00)
Health & Medicine > Therapeutic Area > Oncology (0.94)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Communications > Networks (1.00)
Information Technology > Cloud Computing (1.00)
(6 more...)

Add feedback

Coronavirus Update: GOP Senators Disagree With Trump On COVID-19 Testing, 'There Are Still Shortfalls'

International Business TimesMay-8-2020, 14:05:40 GMT

Republican senators are saying out loud the extent of mass testing for COVID-19 in the United States isn't where it should be -- not by a long shot -- and contradict president Donald Trump's oft repeated claims the U.S. has so much testing available. "We have so much testing," claimed Trump Thursday. Mass testing is one of the only few known ways to end the COVID-19 pandemic in this country. The U.S. has conducted only 8.1 million tests since February. The White House says its goal is two million tests per week per state by the end of May.

artificial intelligence, gop senator disagree, machine learning, (10 more...)

International Business Times

Country: North America > United States (1.00)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)
Government > Regional Government > North America Government > United States Government (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.32)

Add feedback

In Pursuit of Interpretable, Fair and Accurate Machine Learning for Criminal Recidivism Prediction

Wang, Caroline, Han, Bin, Patel, Bhrij, Mohideen, Feroze, Rudin, Cynthia

arXiv.org Machine LearningMay-8-2020

In recent years, academics and investigative journalists have criticized certain commercial risk assessments for their black-box nature and failure to satisfy competing notions of fairness. Since then, the field of interpretable machine learning has created simple yet effective algorithms, while the field of fair machine learning has proposed various mathematical definitions of fairness. However, studies from these fields are largely independent, despite the fact that many applications of machine learning to social issues require both fairness and interpretability. We explore the intersection by revisiting the recidivism prediction problem using state-of-the-art tools from interpretable machine learning, and assessing the models for performance, interpretability, and fairness. Unlike previous works, we compare against two existing risk assessments (COMPAS and the Arnold Public Safety Assessment) and train models that output probabilities rather than binary predictions. We present multiple models that beat these risk assessments in performance, and provide a fairness analysis of these models. Our results imply that machine learning models should be trained separately for separate locations, and updated over time.

artificial intelligence, machine learning, recidivism, (16 more...)

arXiv.org Machine Learning

2005.04176

Country:

North America > United States > Kentucky (0.09)
North America > United States > Virginia (0.04)
North America > United States > Wisconsin (0.04)
(20 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Law > Criminal Law (1.00)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
(4 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.46)

Add feedback

An Extensive Study on Cross-Dataset Bias and Evaluation Metrics Interpretation for Machine Learning applied to Gastrointestinal Tract Abnormality Classification

Thambawita, Vajira, Jha, Debesh, Hammer, Hugo Lewi, Johansen, Håvard D., Johansen, Dag, Halvorsen, Pål, Riegler, Michael A.

arXiv.org Machine LearningMay-8-2020

Precise and efficient automated identification of Gastrointestinal (GI) tract diseases can help doctors treat more patients and improve the rate of disease detection and identification. Currently, automatic analysis of diseases in the GI tract is a hot topic in both computer science and medical-related journals. Nevertheless, the evaluation of such an automatic analysis is often incomplete or simply wrong. Algorithms are often only tested on small and biased datasets, and cross-dataset evaluations are rarely performed. A clear understanding of evaluation metrics and machine learning models with cross datasets is crucial to bring research in the field to a new quality level. Towards this goal, we present comprehensive evaluations of five distinct machine learning models using Global Features and Deep Neural Networks that can classify 16 different key types of GI tract conditions, including pathological findings, anatomical landmarks, polyp removal conditions, and normal findings from images captured by common GI tract examination instruments. In our evaluation, we introduce performance hexagons using six performance metrics such as recall, precision, specificity, accuracy, F1-score, and Matthews Correlation Coefficient to demonstrate how to determine the real capabilities of models rather than evaluating them shallowly. Furthermore, we perform cross-dataset evaluations using different datasets for training and testing. With these cross-dataset evaluations, we demonstrate the challenge of actually building a generalizable model that could be used across different hospitals. Our experiments clearly show that more sophisticated performance metrics and evaluation methods need to be applied to get reliable models rather than depending on evaluations of the splits of the same dataset, i.e., the performance metrics should always be interpreted together rather than relying on a single metric.

artificial intelligence, deep learning, machine learning, (18 more...)

arXiv.org Machine Learning

2005.03912

Country:

Europe > Norway > Eastern Norway > Oslo (0.05)
Europe > Norway > Northern Norway > Troms > Tromsø (0.04)
Africa > Cameroon > Far North Region > Maroua (0.04)

Genre: Research Report > New Finding (1.00)

Industry:

Health & Medicine > Therapeutic Area > Oncology (1.00)
Health & Medicine > Therapeutic Area > Gastroenterology (1.00)
Health & Medicine > Diagnostic Medicine (0.95)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.93)

Add feedback

The scalable Birth-Death MCMC Algorithm for Mixed Graphical Model Learning with Application to Genomic Data Integration

Wang, Nanwei, Briollais, Laurent, Massam, Helene

arXiv.org Machine LearningMay-8-2020

Recent advances in biological research have seen the emergence of high-throughput technologies with numerous applications that allow the study of biological mechanisms at an unprecedented depth and scale. A large amount of genomic data is now distributed through consortia like The Cancer Genome Atlas (TCGA), where specific types of biological information on specific type of tissue or cell are available. In cancer research, the challenge is now to perform integrative analyses of high-dimensional multi-omic data with the goal to better understand genomic processes that correlate with cancer outcomes, e.g. elucidate gene networks that discriminate a specific cancer subgroups (cancer sub-typing) or discovering gene networks that overlap across different cancer types (pan-cancer studies). In this paper, we propose a novel mixed graphical model approach to analyze multi-omic data of different types (continuous, discrete and count) and perform model selection by extending the Birth-Death MCMC (BDMCMC) algorithm initially proposed by \citet{stephens2000bayesian} and later developed by \cite{mohammadi2015bayesian}. We compare the performance of our method to the LASSO method and the standard BDMCMC method using simulations and find that our method is superior in terms of both computational efficiency and the accuracy of the model selection results. Finally, an application to the TCGA breast cancer data shows that integrating genomic information at different levels (mutation and expression data) leads to better subtyping of breast cancers.

artificial intelligence, bayesian inference, machine learning, (17 more...)

arXiv.org Machine Learning

2005.04139

Country:

North America > United States (0.14)
Europe > Middle East > Malta (0.04)

Genre: Research Report > Experimental Study (0.46)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Therapeutic Area > Oncology > Breast Cancer (0.55)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.95)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.93)

Add feedback

Compressing Large Sample Data for Discriminant Analysis

Lapanowski, Alexander F., Gaynanova, Irina

arXiv.org Machine LearningMay-8-2020

Large-sample data became prevalent as data acquisition became cheaper and easier. While a large sample size has theoretical advantages for many statistical methods, it presents computational challenges. Sketching, or compression, is a well-studied approach to address these issues in regression settings, but considerably less is known about its performance in classification settings. Here we consider the computational issues due to large sample size within the discriminant analysis framework. We propose a new compression approach for reducing the number of training samples for linear and quadratic discriminant analysis, in contrast to existing compression methods which focus on reducing the number of features. We support our approach with a theoretical bound on the misclassification error rate compared to the Bayes classifier. Empirical studies confirm the significant computational gains of the proposed method and its superior predictive ability compared to random sub-sampling.

artificial intelligence, bayesian inference, machine learning, (19 more...)

arXiv.org Machine Learning

2005.03858

Country:

North America > United States > Texas > Brazos County > College Station (0.04)
North America > United States > Rhode Island > Providence County > Providence (0.04)
North America > United States > New York (0.04)
(4 more...)

Genre: Research Report (0.50)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.73)
(2 more...)

Add feedback

Super-App Behavioral Patterns in Credit Risk Models: Financial, Statistical and Regulatory Implications

Roa, Luisa, Correa-Bahnsen, Alejandro, Suarez, Gabriel, Cortés-Tejada, Fernando, Luque, María A., Bravo, Cristián

arXiv.org Machine LearningMay-8-2020

In this paper we present the impact of alternative data that originates from an app-based marketplace, in contrast to traditional bureau data, upon credit scoring models. These alternative data sources have shown themselves to be immensely powerful in predicting borrower behavior in segments traditionally underserved by banks and financial institutions. Our results, validated across two countries, show that these new sources of data are particularly useful for predicting financial behavior in low-wealth and young individuals, who are also the most likely to engage with alternative lenders. Furthermore, using the TreeSHAP method for Stochastic Gradient Boosting interpretation, our results also revealed interesting non-linear trends in the variables originating from the app, which would not normally be available to traditional banks. Our results represent an opportunity for technology companies to disrupt traditional banking by correctly identifying alternative data sources and handling this new information properly. At the same time alternative data must be carefully validated to overcome regulatory hurdles across diverse jurisdictions.

data mining, information, machine learning, (17 more...)

arXiv.org Machine Learning

2005.14658

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
Europe > Switzerland > Basel-City > Basel (0.04)
South America > Uruguay > Maldonado > Maldonado (0.04)
(6 more...)

Genre: Research Report > New Finding (1.00)

Industry: Banking & Finance > Credit (1.00)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > e-Commerce > Financial Technology (0.96)
(3 more...)

Add feedback

Amazon researchers trained an AI model in multiple languages to improve product searches » techsocialnetwork

#artificialintelligenceMay-7-2020, 17:10:22 GMT

Amazon operates in 14 countries around the world, nine of which are eligible for its Prime yearly subscription service. It goes without saying that the company has a real desire to make available its shopping experience in any number of languages, particularly where customers who speak different dialects are searching for the same products. In pursuit of an efficient means of translating multiple languages, Amazon researchers devised a shopping model called a multitask model, in which the functions overlap across tasks and tend to reinforce each other. They say that their AI, which was trained on data from several different languages at once, delivered better results using any of those languages. As Amazon applied scientist Nikhil Rao explained in a blog post, the reason for the improvement is that a corpus in one language is able to fill gaps in that of another language.

artificial intelligence, machine learning, multiple language, (15 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.34)

Add feedback

South Korean findings suggest 'reinfected' virus cases are false positives

The Japan TimesMay-7-2020, 09:45:04 GMT

SEOUL – South Korean health authorities raised new concerns about the novel coronavirus after reporting last month that dozens of patients who had recovered from the illness later tested positive again. The findings suggested that some people who survived COVID-19 could become reinfected with the virus that causes it, potentially complicating efforts to lift quarantine restrictions and to produce a vaccine. But after weeks of research, they now say that such test results appear to be "false positives" caused by lingering -- but likely not infectious -- bits of the virus. South Korea had reported more than 350 such cases as of Wednesday, according to the Korea Centers for Disease Control and Prevention (KCDC). As more and more South Koreans were released from treatment for COVID-19, authorities discovered a disturbing trend.

artificial intelligence, machine learning, virus, (7 more...)

The Japan Times

Country: Asia > South Korea > Seoul > Seoul (0.29)

Genre: Research Report > New Finding (0.71)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)
Health & Medicine > Epidemiology (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.65)

Add feedback

Predictive Modeling of ICU Healthcare-Associated Infections from Imbalanced Data. Using Ensembles and a Clustering-Based Undersampling Approach

Sánchez-Hernández, Fernando, Ballesteros-Herráez, Juan Carlos, Kraiem, Mohamed S., Sánchez-Barba, Mercedes, Moreno-García, María N.

arXiv.org Machine LearningMay-7-2020

Early detection of patients vulnerable to infections acquired in the hospital environment is a challenge in current health systems given the impact that such infections have on patient mortality and healthcare costs. This work is focused on both the identification of risk factors and the prediction of healthcare-associated infections in intensive-care units by means of machine-learning methods. The aim is to support decision making addressed at reducing the incidence rate of infections. In this field, it is necessary to deal with the problem of building reliable classifiers from imbalanced datasets. We propose a clustering-based undersampling strategy to be used in combination with ensemble classifiers. A comparative study with data from 4616 patients was conducted in order to validate our proposal. We applied several single and ensemble classifiers both to the original dataset and to data preprocessed by means of different resampling methods. The results were analyzed by means of classic and recent metrics specifically designed for imbalanced data classification. They revealed that the proposal is more efficient in comparison with other approaches.

artificial intelligence, classifier, machine learning, (17 more...)

arXiv.org Machine Learning

doi: 10.3390/app9245287

2005.03582

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
Oceania > New Zealand > North Island > Waikato > Hamilton (0.04)
North America > United States > Tennessee > Davidson County > Nashville (0.04)
(15 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Health & Medicine > Therapeutic Area (1.00)
Health & Medicine > Health Care Providers & Services (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.96)

Add feedback