Accuracy
Out-of-domain Detection for Natural Language Understanding in Dialog Systems
Zheng, Yinhe, Chen, Guanyi, Huang, Minlie
In natural language understanding components, detecting out-of-domain (OOD) inputs is important for dialogue systems since wrongly accepting these OOD utterances that are not currently supported may lead to catastrophic failures of the entire system. Entropy regularization is an effective solution to avoid such failures, however, its computation heavily depends on OOD data, which are expensive to collect. In this paper, we propose a novel text generation model to produce high-quality OOD samples and thereby improve the performance of OOD detection. The proposed model can also utilize a set of unlabeled data to improve the effectiveness of these generated OOD samples. Experiments show that our method can effectively improve the OOD detection performance of a NLU module. 1 Introduction Natural Language Understanding (NLU) in dialog systems, particularly including task-oriented dialog systems and intelligent personal assistants, is vital for understanding users' inputs and making effective interactions. NLU maps text inputs to structured user intents, and decides the downstream processing pipelines of a dialog system, thereby becoming a precursor for the success of such systems. Recently, various deep neural network (DNN) based NLU models have been proposed and applied in real-world applications (Kim et al., 2018; Sarikaya, 2017; Y oo et al., 2018). Most existing DNN based NLU modules are built by following a closed-world assumption (Fei and Liu, 2016), i.e, the data used in the training and test phrase are drawn from the same distribution. However, such an assumption is commonly violated in practical systems that are deployed in a dynamic or open environment. Specifically, practical NLU systems often encounter o ut-o f-d omain (OOD) inputs that are not supported by the system and thus not observed in the training data.
Outlier Detection in High Dimensional Data
Most of the existing algorithms fail to properly address the issues stemming from a large number of features. In particular, outlier detection algorithms perform poorly on data set of small size with a large number of features. In this paper, we propose a novel outlier detection algorithm based on principal component analysis and kernel density estimation. The proposed method is designed to address the challenges of dealing with high-dimensional data by projecting the original data onto a smaller space and using the innate structure of the data to calculate anomaly scores for each data point. Numerical experiments on synthetic and real-life data show that our method performs well on high-dimensional data. Our method also produces better-than-average execution times compared to the benchmark methods. Despite the growing amount of data that has become available for research and discovery there remain areas where certain type of data is scarce. In the fields such as medical diagnostics, network intrusion detection, fraudulent financial transactions and many others, deviations from normal behavior, i.e. anomalies, are rare. However, often, these events are of the greatest importance. For example, it would be extremely beneficial to determine if a person has an illness based on abnormal lab results.
Aegis AI Software Detects Gun Threats And Provides Real-Time Alerts
During the Parkland, Florida, school shooting in 2018, the shooter was caught on a security camera pulling his rifle out of a duffle bag in the staircase 15 seconds before discharging the first round. However, the School Resource Officer didn't enter the building because he wasn't confident about the situation, and the Coral Springs Police Department had no idea what the shooter even looked like until 7 minutes and 30 seconds after the first round was fired. If the video system had included technology to recognize the gun threat in real time, alerts could have been sent to the security team. An announcement could have been made right away for all students and faculty in Building 12 to barricade their doors, and law enforcement could have responded a lot faster to a real-time feed of timely and accurate information. Aegis AI offers such a technology, which the company says enables existing security cameras to automatically recognize gun threats and notify security in real-time.
Feature-Set-Engineering for Detecting Freezing of Gait in Parkinson's Disease using Deep Recurrent Neural Networks
Masiala, Spyroula, Huijbers, Willem, Atzmueller, Martin
Freezing of gait (FoG) is a common gait disability in Parkinson's disease, that usually appears in its advanced stage. Freeze episodes are associated with falls, injuries, and psychological consequences, negatively affecting the patients' quality of life. For detecting FoG episodes automatically, a highly accurate detection method is necessary. This paper presents an approach for detecting FoG episodes utilizing a deep recurrent neural network (RNN) on 3D-accelerometer measurements. We investigate suitable features and feature combinations extracted from the sensors' time series data. Specifically, for detecting FoG episodes, we apply a deep RNN with Long Short-Term Memory cells. In our experiments, we perform both user dependent and user independent experiments, to detect freeze episodes. Our experimental results show that the frequency domain features extracted from the trunk sensor are the most informative feature group in the subject independent method, achieving an average AUC score of 93%, Specificity of 90% and Sensitivity of 81%. Moreover, frequency and statistical features of all the sensors are identified as the best single input for the subject dependent method, achieving an average AUC score of 97%, Specificity of 96% and Sensitivity of 87%. Overall, in a comparison to state-of-the-art approaches from literature as baseline methods, our proposed approach outperforms these significantly.
Differentially Private Precision Matrix Estimation
Su, Wenqing, Guo, Xiao, Zhang, Hai
In this paper, we study the problem of precision matrix estimation when the dataset contains sensitive information. In the differential privacy framework, we develop a differentially private ridge estimator by perturbing the sample covariance matrix. Then we develop a differentially private graphical lasso estimator by using the alternating direction method of multipliers (ADMM) algorithm. The theoretical results and empirical results that show the utility of the proposed methods are also provided. Keywords differential privacy, graphical model, ADMM algorithm 1 Introduction Precision matrix plays a fundamental role in many statistical inference problems. For example, in discriminant analysis, the precision matrix needs to be estimated to compute the classification rules[1]. In graphical models, the structure exploration of gaussian graphical model is equivalent to recover the support of the precision matrix[2]. Moreover, the precision matrix is useful for a wide range of applications including portfolio optimization, genomics and single processing, among many others. Therefore, it is of great importance to estimate the precision matrix.
Machine Learning Approaches for Detecting the Depression from Resting-State Electroencephalogram (EEG): A Review Study
Radenković, Milena Čukić, Lopez, Victoria Lopez
In this paper, we aimed at reviewing present literature on employing nonlinear analysis in combination with machine learning methods, in depression detection or prediction task. We are focusing on an affordable data-driven approach, applicable for everyday clinical practice, and in particular, those based on electroencephalographic (EEG) recordings. Among those studies utilizing EEG, we are discussing a group of applications used for detecting the depression based on the resting state EEG (detection studies) and interventional studies (using stimulus in their protocols or aiming to predict the outcome of therapy). We conclude with a discussion and review of guidelines to improve the reliability of developed models that could serve the improvement of diagnostic and more accurate treatment of depression.
Optimizing Generalized Rate Metrics through Game Equilibrium
Narasimhan, Harikrishna, Cotter, Andrew, Gupta, Maya
We present a general framework for solving a large class of learning problems with non-linear functions of classification rates. This includes problems where one wishes to optimize a non-decomposable performance metric such as the F-measure or G-mean, and constrained training problems where the classifier needs to satisfy non-linear rate constraints such as predictive parity fairness, distribution divergences or churn ratios. We extend previous two-player game approaches for constrained optimization to a game between three players to decouple the classifier rates from the non-linear objective, and seek to find an equilibrium of the game. Our approach generalizes many existing algorithms, and makes possible new algorithms with more flexibility and tighter handling of non-linear rate constraints. We provide convergence guarantees for convex functions of rates, and show how our methodology can be extended to handle sums of ratios of rates. Experiments on different fairness tasks confirm the efficacy of our approach.
Master your Metrics with Calibration
Siblini, Wissam, Fréry, Jordan, He-Guelton, Liyun, Oblé, Frédéric, Wang, Yi-Qing
Machine learning models deployed in real-world applications are often evaluated with precision-based metrics such as F1-score or AUC-PR (Area Under the Curve of Precision Recall). Heavily dependent on the class prior, such metrics may sometimes lead to wrong conclusions about the performance. For example, when dealing with non-stationary data streams, they do not allow the user to discern the reasons why a model performance varies across different periods. In this paper, we propose a way to calibrate the metrics so that they are no longer tied to the class prior. It corresponds to a readjustment, based on probabilities, to the value that the metric would have if the class prior was equal to a reference prior (user parameter). We conduct a large number of experiments on balanced and imbalanced data to assess the behavior of calibrated metrics and show that they improve interpretability and provide a better control over what is really measured. We describe specific real-world use-cases where calibration is beneficial such as, for instance, model monitoring in production, reporting, or fairness evaluation.
Student Performance Prediction with Optimum Multilabel Ensemble Model
Yekun, Ephrem Admasu, Teklay, Abrahaley
One of the important measures of quality of education is the performance of students in the academic settings. Nowadays, abundant data is stored in educational institutions about students which can help to discover insight on how students are learning and how to improve their performance ahead of time using data mining techniques. In this paper, we developed a student performance prediction model that predicts the performance of high school students for the next semester for five courses. We modeled our prediction system as a multi-label classification task and used support vector machine (SVM), Random Forest (RF), K-nearest Neighbors (KNN), and Mult-layer perceptron (MLP) as base-classifiers to train our model. We further improved the performance of the prediction model using state-of-the-art partitioning schemes to divide the label space into smaller spaces and use Label Powerset (LP) transformation method to transform each labelset into a multi-class classification task. The proposed model achieved better performance in terms of different evaluation metrics when compared to other multi-label learning tasks such as binary relevance and classifier chains.