AITopics

2207.04574

Country: North America > United States > Washington > King County > Seattle (0.14)

Genre: Research Report (0.64)

Industry: Health & Medicine > Therapeutic Area > Neurology > Alzheimer's Disease (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.34)

arXiv.org Artificial IntelligenceJul-20-2022

Comparative Study on Supervised versus Semi-supervised Machine Learning for Anomaly Detection of In-vehicle CAN Network

Dong, Yongqi, Chen, Kejia, Peng, Yinxuan, Ma, Zhiyuan

As the central nerve of the intelligent vehicle control system, the in-vehicle network bus is crucial to the security of vehicle driving. One of the best standards for the in-vehicle network is the Controller Area Network (CAN bus) protocol. However, the CAN bus is designed to be vulnerable to various attacks due to its lack of security mechanisms. To enhance the security of in-vehicle networks and promote the research in this area, based upon a large scale of CAN network traffic data with the extracted valuable features, this study comprehensively compared fully-supervised machine learning with semi-supervised machine learning methods for CAN message anomaly detection. Both traditional machine learning models (including single classifier and ensemble models) and neural network based deep learning models are evaluated. Furthermore, this study proposed a deep autoencoder based semi-supervised learning method applied for CAN message anomaly detection and verified its superiority over other semi-supervised methods. Extensive experiments show that the fully-supervised methods generally outperform semi-supervised ones as they are using more information as inputs. Typically the developed XGBoost based model obtained state-of-the-art performance with the best accuracy (98.65%), precision (0.9853), and ROC AUC (0.9585) beating other methods reported in the literature.

algorithm, anomaly detection, detection, (12 more...)

2207.10286

Country:

Asia > China > Shanghai > Shanghai (0.05)
Europe > Netherlands > South Holland > Delft (0.04)
Europe > United Kingdom (0.04)

Genre: Research Report (0.65)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Data Science > Data Mining > Anomaly Detection (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.95)

Bateni, Arash, Chan, Matthew C., Eitel-Porter, Ray

AI Fairness: from Principles to Practice

arXiv.org Artificial IntelligenceJul-20-2022

This paper summarizes and evaluates various approaches, methods, and techniques for pursuing fairness in artificial intelligence (AI) systems. It examines the merits and shortcomings of these measures and proposes practical guidelines for defining, measuring, and preventing bias in AI. In particular, it cautions against some of the simplistic, yet common, methods for evaluating bias in AI systems, and offers more sophisticated and effective alternatives. The paper also addresses widespread controversies and confusions in the field by providing a common language among different stakeholders of high-impact AI systems. It describes various trade-offs involving AI fairness, and provides practical recommendations for balancing them. It offers techniques for evaluating the costs and benefits of fairness targets, and defines the role of human judgment in setting these targets. This paper provides discussions and guidelines for AI practitioners, organization leaders, and policymakers, as well as various links to additional materials for a more technical audience. Numerous real-world examples are provided to clarify the concepts, challenges, and recommendations from a practical perspective.

accuracy, fairness, training data, (15 more...)

2207.09833

Country:

North America > Canada > Ontario > Toronto (0.14)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
North America > United States > Minnesota (0.04)
(2 more...)

Genre: Research Report (0.83)

Industry:

Law > Civil Rights & Constitutional Law (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
Banking & Finance (1.00)
Law > Labor & Employment Law (0.68)

Technology:

Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.89)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.69)
Information Technology > Artificial Intelligence > Applied AI (0.67)

Learning from few examples: Classifying sex from retinal images via deep learning

Berk, Aaron, Ozturan, Gulcenur, Delavari, Parsa, Maberley, David, Yılmaz, Özgür, Oruc, Ipek

Deep learning has seen tremendous interest in medical imaging, particularly in the use of convolutional neural networks (CNNs) for developing automated diagnostic tools. The facility of its non-invasive acquisition makes retinal fundus imaging amenable to such automated approaches. Recent work in analyzing fundus images using CNNs relies on access to massive data for training and validation - hundreds of thousands of images. However, data residency and data privacy restrictions stymie the applicability of this approach in medical settings where patient confidentiality is a mandate. Here, we showcase results for the performance of DL on small datasets to classify patient sex from fundus images - a trait thought not to be present or quantifiable in fundus images until recently. We fine-tune a Resnet-152 model whose last layer has been modified for binary classification. In several experiments, we assess performance in the small dataset context using one private (DOVS) and one public (ODIR) data source. Our models, developed using approximately 2500 fundus images, achieved test AUC scores of up to 0.72 (95% CI: [0.67, 0.77]). This corresponds to a mere 25% decrease in performance despite a nearly 1000-fold decrease in the dataset size compared to prior work in the literature. Even with a hard task like sex categorization from retinal images, we find that classification is possible with very small datasets. Additionally, we perform domain adaptation experiments between DOVS and ODIR; explore the effect of data curation on training and generalizability; and investigate model ensembling to maximize CNN classifier performance in the context of small development datasets.

artificial intelligence, deep learning, machine learning, (17 more...)

2207.09624

Country:

North America > Canada > Quebec > Montreal (0.14)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
North America > United States > New York > New York County > New York City (0.04)
(3 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Health & Medicine > Therapeutic Area > Ophthalmology/Optometry (1.00)
Health & Medicine > Diagnostic Medicine > Imaging (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Yan, Bobby, Seto, Skyler, Apostoloff, Nicholas

FORML: Learning to Reweight Data for Fairness

Machine learning models are trained to minimize the mean loss for a single metric, and thus typically do not consider fairness and robustness. Neglecting such metrics in training can make these models prone to fairness violations when training data are imbalanced or test distributions differ. This work introduces Fairness Optimized Reweighting via Meta-Learning (FORML), a training algorithm that balances fairness and robustness with accuracy by jointly learning training sample weights and neural network parameters. The approach increases model fairness by learning to balance the contributions from both over- and under-represented sub-groups through dynamic reweighting of the data learned from a user-specified held-out set representative of the distribution under which fairness is desired. FORML improves equality of opportunity fairness criteria on image classification tasks, reduces bias of corrupted labels, and facilitates building more fair datasets via data condensation. These improvements are achieved without pre-processing data or post-processing model outputs, without learning an additional weighting function, without changing model architecture, and while maintaining accuracy on the original predictive metric.

artificial intelligence, forml, machine learning, (15 more...)

2202.01719

Country: North America > United States > Maryland > Baltimore (0.04)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.51)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.47)

Abukasis, Danit Shifman, Cohen, Izack, Xian, Xiaochen, Huang, Kejun, Singer, Gonen

Adaptive Learning for the Resource-Constrained Classification Problem

Classification applications are typically associated with misclassification costs and benefits as a result of incorrect and correct classification, respectively. Many studies have focused on cost-sensitive classification approaches [7, 8, 9, 10, 11, 12] in an effort to reduce the costs of misclassification. We illustrate the concept of imbalanced misclassification costs using the current and real-world example of classifying COVID-19 patients. Incorrectly classifying an ill patient as healthy may put this patient's life at risk as well as others by allowing the ill person to circulate among healthy persons and infect them (an intangible cost, usually determined by the judicial system). Classifying a healthy individual as a COVID-19 patient, on the other hand, may lead to unnecessary treatment, misuse of medical resources and cause unnecessary financial hardship to the individual and the general economy. Many studies have applied cost-sensitive approaches to handling imbalanced classification problems [13, 14] where the decision maker is interested in detecting the positive cases. There are four main approaches for making a classifier cost-sensitive: (i) changing the distribution of classes using over-and under-sampling within the training data set (i.e., preprocessing of the training data) to reduce misclassification costs [7, 8], denoted hereafter approach A1; (ii) changing the data set according to the misclassified samples of the cost-insensitive classifiers and their error costs (post-processing the training data) using a boosting approach in ensemble learning methods [12, 15], denoted hereafter approach A2; (iii) incorporating meta-learning methods on outputs of cost-insensitive learners using threshold driven techniques in favor of utilizing the probability estimations for the classes [7, 8, 16, 17], hereafter denoted A3; (iv) directly incorporating cost-sensitive capabilities into a learning algorithm, i.e., an algorithm-level solution that adapts existing learning methods so they are biased towards classes with high misclassification costs, usually presented by minority classes [8, 18].

artificial intelligence, machine learning, resource constraint, (18 more...)

doi: 10.1016/j.engappai.2022.105741

2207.09196

Country:

North America > United States > Florida > Alachua County > Gainesville (0.14)
North America > United States > New York (0.04)
Asia > Middle East > Israel (0.04)

Genre:

Research Report > Experimental Study (0.46)
Research Report > New Finding (0.46)

Industry: Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.96)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.97)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)

Das, Sandipan, Javid, Alireza M., Gohain, Prakash Borpatra, Eldar, Yonina C., Chatterjee, Saikat

Neural Greedy Pursuit for Feature Selection

arXiv.org Machine LearningJul-19-2022

We propose a greedy algorithm to select $N$ important features among $P$ input features for a non-linear prediction problem. The features are selected one by one sequentially, in an iterative loss minimization procedure. We use neural networks as predictors in the algorithm to compute the loss and hence, we refer to our method as neural greedy pursuit (NGP). NGP is efficient in selecting $N$ features when $N \ll P$, and it provides a notion of feature importance in a descending order following the sequential selection procedure. We experimentally show that NGP provides better performance than several feature selection methods such as DeepLIFT and Drop-one-out loss. In addition, we experimentally show a phase transition behavior in which perfect selection of all $N$ features without false positives is possible when the training data size exceeds a threshold.

artificial intelligence, machine learning, predictor, (17 more...)

arXiv.org Machine Learning

doi: 10.1109/IJCNN55064.2022.9892946

2207.0939

Country:

Europe > Sweden > Stockholm > Stockholm (0.05)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Middle East > Israel (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.35)

Lazy Estimation of Variable Importance for Large Neural Networks

Gao, Yue, Stevens, Abby, Willet, Rebecca, Raskutti, Garvesh

As opaque predictive models increasingly impact many areas of modern life, interest in quantifying the importance of a given input variable for making a specific prediction has grown. Recently, there has been a proliferation of model-agnostic methods to measure variable importance (VI) that analyze the difference in predictive power between a full model trained on all variables and a reduced model that excludes the variable(s) of interest. A bottleneck common to these methods is the estimation of the reduced model for each variable (or subset of variables), which is an expensive process that often does not come with theoretical guarantees. In this work, we propose a fast and flexible method for approximating the reduced model with important inferential guarantees. We replace the need for fully retraining a wide neural network by a linearization initialized at the full model parameters. By adding a ridge-like penalty to make the problem convex, we prove that when the ridge penalty parameter is sufficiently large, our method estimates the variable importance measure with an error rate of $O(\frac{1}{\sqrt{n}})$ where $n$ is the number of training samples. We also show that our estimator is asymptotically normal, enabling us to provide confidence bounds for the VI estimates. We demonstrate through simulations that our method is fast and accurate under several data-generating regimes, and we demonstrate its real-world applicability on a seasonal climate forecasting example.

neural network, shapley value, theorem 4, (17 more...)

2207.09097

Country:

North America > United States > Wisconsin > Dane County > Madison (0.14)
North America > United States > Illinois > Cook County > Chicago (0.04)
Pacific Ocean (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.34)

Pliushch, Iuliia, Mundt, Martin, Lupp, Nicolas, Ramesh, Visvanathan

When Deep Classifiers Agree: Analyzing Correlations between Learning Order and Image Statistics

Although a plethora of architectural variants for deep classification has been introduced over time, recent works have found empirical evidence towards similarities in their training process. It has been hypothesized that neural networks converge not only to similar representations, but also exhibit a notion of empirical agreement on which data instances are learned first. Following in the latter works$'$ footsteps, we define a metric to quantify the relationship between such classification agreement over time, and posit that the agreement phenomenon can be mapped to core statistics of the investigated dataset. We empirically corroborate this hypothesis across the CIFAR10, Pascal, ImageNet and KTH-TIPS2 datasets. Our findings indicate that agreement seems to be independent of specific architectures, training hyper-parameters or labels, albeit follows an ordering according to image statistics.

agreement, correlation, pearson, (15 more...)

2105.08997

Country:

Europe > Germany > Hesse > Darmstadt Region > Darmstadt (0.04)
Europe > Spain (0.04)
Europe > Germany > Hesse > Darmstadt Region > Frankfurt (0.04)
North America > Canada > Ontario > Toronto (0.04)

Genre: Research Report > New Finding (0.66)

Industry: Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Sensing and Signal Processing > Image Processing (0.93)
(2 more...)

AutoDES: AutoML Pipeline Generation of Classification with Dynamic Ensemble Strategy Selection

Zhao, Yunpu, Zhang, Rui, Li, Xiaqing

Automating machine learning has achieved remarkable technological developments in recent years, and building an automated machine learning pipeline is now an essential task. The model ensemble is the technique of combining multiple models to get a better and more robust model. However, existing automated machine learning tends to be simplistic in handling the model ensemble, where the ensemble strategy is fixed, such as stacked generalization. There have been many techniques on different ensemble methods, especially ensemble selection, and the fixed ensemble strategy limits the upper limit of the model's performance. In this article, we present a novel framework for automated machine learning. Our framework incorporates advances in dynamic ensemble selection, and to our best knowledge, our approach is the first in the field of AutoML to search and optimize ensemble strategies. In the comparison experiments, our method outperforms the state-of-the-art automated machine learning frameworks with the same CPU time in 42 classification datasets from the OpenML platform. Ablation experiments on our framework validate the effectiveness of our proposed method.

dataset, ensemble strategy, selection, (14 more...)

2201.00207

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)
Europe > Greece > Crete > Chania (0.04)
Asia > Taiwan (0.04)
Asia > China (0.04)

Genre:

Research Report (1.00)
Overview (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.46)