Accuracy
Federated Learning for Intrusion Detection in IoT Security: A Hybrid Ensemble Approach
Chatterjee, Sayan, Hanawal, Manjesh K.
Critical role of Internet of Things (IoT) in various domains like smart city, healthcare, supply chain and transportation has made them the target of malicious attacks. Past works in this area focused on centralized Intrusion Detection System (IDS), assuming the existence of a central entity to perform data analysis and identify threats. However, such IDS may not always be feasible, mainly due to spread of data across multiple sources and gathering at central node can be costly. Also, the earlier works primarily focused on improving True Positive Rate (TPR) and ignored the False Positive Rate (FPR), which is also essential to avoid unnecessary downtime of the systems. In this paper, we first present an architecture for IDS based on hybrid ensemble model, named PHEC, which gives improved performance compared to state-of-the-art architectures. We then adapt this model to a federated learning framework that performs local training and aggregates only the model parameters. Next, we propose Noise-Tolerant PHEC in centralized and federated settings to address the label-noise problem. The proposed idea uses classifiers using weighted convex surrogate loss functions. Natural robustness of KNN classifier towards noisy data is also used in the proposed architecture. Experimental results on four benchmark datasets drawn from various security attacks show that our model achieves high TPR while keeping FPR low on noisy and clean data. Further, they also demonstrate that the hybrid ensemble models achieve performance in federated settings close to that of the centralized settings.
Evaluating OCR Output Quality with Character Error Rate (CER) and Word Error Rate (WER)
The usual way of evaluating prediction output is with the accuracy metric, where we indicate a match (1) or a no match (0). However, this does not provide enough granularity to effectively assess OCR performance. We should instead use error rates to determine the extent to which the OCR transcribed text and ground truth text (i.e. A common intuition is to see how many characters were misspelled. While this is correct, the actual error rate calculation is more complex than that.
Unsupervised machine learning application in ACL
De-Sheng Chen,* Tong-Fu Wang,* Jia-Wang Zhu, Bo Zhu, Zeng-Liang Wang, Jian-Gang Cao, Cai-Hong Feng, Jun-Wei Zhao Department of Sports Medicine and Arthroscopy, Tianjin Hospital of Tianjin University, Tianjin, People's Republic of China *These authors contributed equally to this work Correspondence: Jia-Wang Zhu Department of Sports Medicine and Arthroscopy, Tianjin Hospital of Tianjin University, Tianjin, People's Republic of China Email [email protected] Purpose: We aim to present an unsupervised machine learning application in anterior cruciate ligament (ACL) rupture and evaluate whether supervised machine learning-derived radiomics features enable prediction of ACL rupture accurately. Patients and Methods: Sixty-eight patients were reviewed. Their demographic features were recorded, radiomics features were extracted, and the input dataset was defined as a collection of demographic features and radiomics features. The input dataset was automatically classified by the unsupervised machine learning algorithm. Then, we used a supervised machine learning algorithm to construct a radiomics model.
Interpreting Depression From Question-wise Long-term Video Recording of SDS Evaluation
Xie, Wanqing, Liang, Lizhong, Lu, Yao, Wang, Chen, Shen, Jihong, Luo, Hui, Liu, Xiaofeng
Self-Rating Depression Scale (SDS) questionnaire has frequently been used for efficient depression preliminary screening. However, the uncontrollable self-administered measure can be easily affected by insouciantly or deceptively answering, and producing the different results with the clinician-administered Hamilton Depression Rating Scale (HDRS) and the final diagnosis. Clinically, facial expression (FE) and actions play a vital role in clinician-administered evaluation, while FE and action are underexplored for self-administered evaluations. In this work, we collect a novel dataset of 200 subjects to evidence the validity of self-rating questionnaires with their corresponding question-wise video recording. To automatically interpret depression from the SDS evaluation and the paired video, we propose an end-to-end hierarchical framework for the long-term variable-length video, which is also conditioned on the questionnaire results and the answering time. Specifically, we resort to a hierarchical model which utilizes a 3D CNN for local temporal pattern exploration and a redundancy-aware self-attention (RAS) scheme for question-wise global feature aggregation. Targeting for the redundant long-term FE video processing, our RAS is able to effectively exploit the correlations of each video clip within a question set to emphasize the discriminative information and eliminate the redundancy based on feature pair-wise affinity. Then, the question-wise video feature is concatenated with the questionnaire scores for final depression detection. Our thorough evaluations also show the validity of fusing SDS evaluation and its video recording, and the superiority of our framework to the conventional state-of-the-art temporal modeling methods.
Smart Healthcare in the Age of AI: Recent Advances, Challenges, and Future Prospects
Nasr, Mahmoud, Islam, MD. Milon, Shehata, Shady, Karray, Fakhri, Quintana, Yuri
The significant increase in the number of individuals with chronic ailments (including the elderly and disabled) has dictated an urgent need for an innovative model for healthcare systems. The evolved model will be more personalized and less reliant on traditional brick-and-mortar healthcare institutions such as hospitals, nursing homes, and long-term healthcare centers. The smart healthcare system is a topic of recently growing interest and has become increasingly required due to major developments in modern technologies, especially in artificial intelligence (AI) and machine learning (ML). This paper is aimed to discuss the current state-of-the-art smart healthcare systems highlighting major areas like wearable and smartphone devices for health monitoring, machine learning for disease diagnosis, and the assistive frameworks, including social robots developed for the ambient assisted living environment. Additionally, the paper demonstrates software integration architectures that are very significant to create smart healthcare systems, integrating seamlessly the benefit of data analytics and other tools of AI. The explained developed systems focus on several facets: the contribution of each developed framework, the detailed working procedure, the performance as outcomes, and the comparative merits and limitations. The current research challenges with potential future directions are addressed to highlight the drawbacks of existing systems and the possible methods to introduce novel frameworks, respectively. This review aims at providing comprehensive insights into the recent developments of smart healthcare systems to equip experts to contribute to the field.
Stroke Prediction using Data Analytics and Machine Learning
Stroke is the second leading cause of death worldwide. According to the World Health Organization [1], 5 million people worldwide suffer a stroke every year. Of these, one third die and another third are left permanently disabled. In the United States, someone has a stroke every 40 seconds and every four minutes, someone dies [2]. The aftermath is devastating, with victims experiencing a wide range of disabling symptoms including sudden paralysis, speech loss or blindness due to blood flow interruption in the brain [3].
FISH-Net: Automating the Fish Doorbell
Many different fish species travel vast distances every year to reach their breeding grounds. Nowadays this journey is made more difficult due to obstacles such as water locks. One of these water locks placed along a popular route for migrating fish is the Weerdsluis in Utrecht. In order raise awareness, an initiative was launched together with the municipality of Utrecht: the Fish Doorbell. Here users could watch a livestream of the waters at the lock, and'ring' a bell if they spotted a fish.
Machine Learning Using Python Programming
'Machine Learning is all about how a machine with an artificial intelligence learns like a human being' Welcome to the course on Machine Learning and Implementing it using Python 3. As the title says, this course recommends to have a basic knowledge in Python 3 to grasp the implementation part easily but it is not compulsory. This course has strong content on the core concepts of ML such as it's features, the steps involved in building a ML Model – Data Preprocessing, Finetuning the Model, Overfitting, Underfitting, Bias, Variance, Confusion Matrix and performance measures of a ML Model. We'll understand the importance of many preprocessing techniques such as Binarization, MinMaxScaler, Standard Scaler We can implement many ML Algorithms in Python using scikit-learn library in a few lines. Can't we? Yet, that won't help us to understand the algorithms. Hence, in this course, we'll first look into understanding the mathematics and concepts behind the algorithms and then, we'll implement the same in Python.
Machine Learning using Python Programming
'Machine Learning is all about how a machine with an artificial intelligence learns like a human being' Welcome to the course on Machine Learning and Implementing it using Python 3. As the title says, this course recommends to have a basic knowledge in Python 3 to grasp the implementation part easily but it is not compulsory. This course has strong content on the core concepts of ML such as it's features, the steps involved in building a ML Model - Data Preprocessing, Finetuning the Model, Overfitting, Underfitting, Bias, Variance, Confusion Matrix and performance measures of a ML Model. We'll understand the importance of many preprocessing techniques such as Binarization, MinMaxScaler, Standard Scaler We can implement many ML Algorithms in Python using scikit-learn library in a few lines. Can't we? Yet, that won't help us to understand the algorithms. Hence, in this course, we'll first look into understanding the mathematics and concepts behind the algorithms and then, we'll implement the same in Python.
Multi-objective Asynchronous Successive Halving
Schmucker, Robin, Donini, Michele, Zafar, Muhammad Bilal, Salinas, David, Archambeau, Cédric
Hyperparameter optimization (HPO) is increasingly used to automatically tune the predictive performance (e.g., accuracy) of machine learning models. However, in a plethora of real-world applications, accuracy is only one of the multiple -- often conflicting -- performance criteria, necessitating the adoption of a multi-objective (MO) perspective. While the literature on MO optimization is rich, few prior studies have focused on HPO. In this paper, we propose algorithms that extend asynchronous successive halving (ASHA) to the MO setting. Considering multiple evaluation metrics, we assess the performance of these methods on three real world tasks: (i) Neural architecture search, (ii) algorithmic fairness and (iii) language model optimization. Our empirical analysis shows that MO ASHA enables to perform MO HPO at scale. Further, we observe that that taking the entire Pareto front into account for candidate selection consistently outperforms multi-fidelity HPO based on MO scalarization in terms of wall-clock time. Our algorithms (to be open-sourced) establish new baselines for future research in the area.