Performance Analysis
Calibrating sufficiently
Binary classification, in the first place, deals with decision tools (classifiers) that facilitate the prediction of the classes of instances on the basis of the so-called features of the instances. Accordingly, the simplest classifiers are crisp (or discrete) in the sense of having the set {0, 1} as output range: 1 for'predict positive class', 0 for'predict negative class. Scoring (or soft) classifiers provide output in a continuous range, usually with the interpretation that high values indicate high likelihood of the instance belonging to the positive class, while low values suggest that membership of the negative class is more likely. In many applications of classification, there is a need for'calibrated' probabilistic classifiers which reflect the likelihood of the positive class given the features of an instance in a frequentist statistical sense (Platt, 2000; Zadrozny and Elkan, 2002; Cohen and Goldszmidt, 2004; Kull et al., 2017). How to best achieve good calibration and how to measure it are active research areas (Böken, 2021; Roelofs et al., 2020).
Improving Entropic Out-of-Distribution Detection using Isometric Distances and the Minimum Distance Score
Macêdo, David, Ludermir, Teresa
Current out-of-distribution detection approaches usually present special requirements (e.g., collecting outlier data and hyperparameter validation) and produce side effects (classification accuracy drop and slow/inefficient inferences). Recently, entropic out-of-distribution detection has been proposed as a seamless approach (i.e., a solution that avoids all the previously mentioned drawbacks). The entropic out-of-distribution detection solution comprises the IsoMax loss for training and the entropic score for out-of-distribution detection. The IsoMax loss works as a SoftMax loss drop-in replacement because swapping the SoftMax loss with the IsoMax loss requires no changes in the model's architecture or training procedures/hyperparameters. In this paper, we propose to perform what we call an isometrization of the distances used in the IsoMax loss. Additionally, we propose to replace the entropic score with the minimum distance score. Our experiments showed that these simple modifications increase out-of-distribution detection performance while keeping the solution seamless.
The Dark Machines Anomaly Score Challenge: Benchmark Data and Model Independent Event Classification for the Large Hadron Collider
Aarrestad, T., van Beekveld, M., Bona, M., Boveia, A., Caron, S., Davies, J., De Simone, A., Doglioni, C., Duarte, J. M., Farbin, A., Gupta, H., Hendriks, L., Heinrich, L., Howarth, J., Jawahar, P., Jueid, A., Lastow, J., Leinweber, A., Mamuzic, J., Merényi, E., Morandini, A., Moskvitina, P., Nellist, C., Ngadiuba, J., Ostdiek, B., Pierini, M., Ravina, B., de Austri, R. Ruiz, Sekmen, S., Touranakou, M., Vaškevičiūte, M., Vilalta, R., Vlimant, J. R., Verheyen, R., White, M., Wulff, E., Wallin, E., Wozniak, K. A., Zhang, Z.
We describe the outcome of a data challenge conducted as part of the Dark Machines Initiative and the Les Houches 2019 workshop on Physics at TeV colliders. The challenged aims at detecting signals of new physics at the LHC using unsupervised machine learning algorithms. First, we propose how an anomaly score could be implemented to define model-independent signal regions in LHC searches. We define and describe a large benchmark dataset, consisting of >1 Billion simulated LHC events corresponding to $10~\rm{fb}^{-1}$ of proton-proton collisions at a center-of-mass energy of 13 TeV. We then review a wide range of anomaly detection and density estimation algorithms, developed in the context of the data challenge, and we measure their performance in a set of realistic analysis environments. We draw a number of useful conclusions that will aid the development of unsupervised new physics searches during the third run of the LHC, and provide our benchmark dataset for future studies at https://www.phenoMLdata.org. Code to reproduce the analysis is provided at https://github.com/bostdiek/DarkMachines-UnsupervisedChallenge.
Robust Regularization with Adversarial Labelling of Perturbed Samples
Guo, Xiaohui, Zhang, Richong, Zheng, Yaowei, Mao, Yongyi
Recent researches have suggested that the predictive accuracy of neural network may contend with its adversarial robustness. This presents challenges in designing effective regularization schemes that also provide strong adversarial robustness. Revisiting Vicinal Risk Minimization (VRM) as a unifying regularization principle, we propose Adversarial Labelling of Perturbed Samples (ALPS) as a regularization scheme that aims at improving the generalization ability and adversarial robustness of the trained model. ALPS trains neural networks with synthetic samples formed by perturbing each authentic input sample towards another one along with an adversarially assigned label. The ALPS regularization objective is formulated as a min-max problem, in which the outer problem is minimizing an upper-bound of the VRM loss, and the inner problem is L$_1$-ball constrained adversarial labelling on perturbed sample. The analytic solution to the induced inner maximization problem is elegantly derived, which enables computational efficiency. Experiments on the SVHN, CIFAR-10, CIFAR-100 and Tiny-ImageNet datasets show that the ALPS has a state-of-the-art regularization performance while also serving as an effective adversarial training scheme.
Detecting Adversarial Examples with Bayesian Neural Network
Li, Yao, Tang, Tongyi, Hsieh, Cho-Jui, Lee, Thomas C. M.
In this paper, we propose a new framework to detect adversarial examples motivated by the observations that random components can improve the smoothness of predictors and make it easier to simulate output distribution of deep neural network. With these observations, we propose a novel Bayesian adversarial example detector, short for BATer, to improve the performance of adversarial example detection. In specific, we study the distributional difference of hidden layer output between natural and adversarial examples, and propose to use the randomness of Bayesian neural network (BNN) to simulate hidden layer output distribution and leverage the distribution dispersion to detect adversarial examples. The advantage of BNN is that the output is stochastic while neural networks without random components do not have such characteristics. Empirical results on several benchmark datasets against popular attacks show that the proposed BATer outperforms the state-of-the-art detectors in adversarial example detection.
California County Hopes Artificial Intelligence Can Mitigate Wildfire Risk
At this time of year, periodic rain showers on the north coast of California give way to months of daily sunshine and a wildfire risk that grows in severity until the next fall rains arrive. In Sonoma County, a new set of eyes is watching over the forest. Those eyes will be able to tap into an artificial intelligence program to make sure emergency dispatchers are alerted to actual fires instead of mist rising off the forest floor or steam from the region's numerous natural geysers. The county has entered into a $300,000 contract with South Korea technology firm Alchera to provide artificial intelligence software that can alert fire dispatchers to the precise location of flames or smoke. The two-year pilot project is funded through $3 million in hazard mitigation grants that the Federal Emergency Management Agency awarded to the county.
Characterizing the SLOPE Trade-off: A Variational Perspective and the Donoho-Tanner Limit
Bu, Zhiqi, Klusowski, Jason, Rush, Cynthia, Su, Weijie J.
Sorted l1 regularization has been incorporated into many methods for solving high-dimensional statistical estimation problems, including the SLOPE estimator in linear regression. In this paper, we study how this relatively new regularization technique improves variable selection by characterizing the optimal SLOPE trade-off between the false discovery proportion (FDP) and true positive proportion (TPP) or, equivalently, between measures of type I error and power. Assuming a regime of linear sparsity and working under Gaussian random designs, we obtain an upper bound on the optimal trade-off for SLOPE, showing its capability of breaking the Donoho-Tanner power limit. To put it into perspective, this limit is the highest possible power that the Lasso, which is perhaps the most popular l1-based method, can achieve even with arbitrarily strong effect sizes. Next, we derive a tight lower bound that delineates the fundamental limit of sorted l1 regularization in optimally trading the FDP off for the TPP. Finally, we show that on any problem instance, SLOPE with a certain regularization sequence outperforms the Lasso, in the sense of having a smaller FDP, larger TPP and smaller l2 estimation risk simultaneously. Our proofs are based on a novel technique that reduces a variational calculus problem to a class of infinite-dimensional convex optimization problems and a very recent result from approximate message passing theory.
Cross-Referencing Self-Training Network for Sound Event Detection in Audio Mixtures
Park, Sangwook, Han, David K., Elhilali, Mounya
Sound event detection is an important facet of audio tagging that aims to identify sounds of interest and define both the sound category and time boundaries for each sound event in a continuous recording. With advances in deep neural networks, there has been tremendous improvement in the performance of sound event detection systems, although at the expense of costly data collection and labeling efforts. In fact, current state-of-the-art methods employ supervised training methods that leverage large amounts of data samples and corresponding labels in order to facilitate identification of sound category and time stamps of events. As an alternative, the current study proposes a semi-supervised method for generating pseudo-labels from unsupervised data using a student-teacher scheme that balances self-training and cross-training. Additionally, this paper explores post-processing which extracts sound intervals from network prediction, for further improvement in sound event detection performance. The proposed approach is evaluated on sound event detection task for the DCASE2020 challenge. The results of these methods on both "validation" and "public evaluation" sets of DESED database show significant improvement compared to the state-of-the art systems in semi-supervised learning.
MTH-IDS: A Multi-Tiered Hybrid Intrusion Detection System for Internet of Vehicles
Yang, Li, Moubayed, Abdallah, Shami, Abdallah
Modern vehicles, including connected vehicles and autonomous vehicles, nowadays involve many electronic control units connected through intra-vehicle networks to implement various functionalities and perform actions. Modern vehicles are also connected to external networks through vehicle-to-everything technologies, enabling their communications with other vehicles, infrastructures, and smart devices. However, the improving functionality and connectivity of modern vehicles also increase their vulnerabilities to cyber-attacks targeting both intra-vehicle and external networks due to the large attack surfaces. To secure vehicular networks, many researchers have focused on developing intrusion detection systems (IDSs) that capitalize on machine learning methods to detect malicious cyber-attacks. In this paper, the vulnerabilities of intra-vehicle and external networks are discussed, and a multi-tiered hybrid IDS that incorporates a signature-based IDS and an anomaly-based IDS is proposed to detect both known and unknown attacks on vehicular networks. Experimental results illustrate that the proposed system can detect various types of known attacks with 99.99% accuracy on the CAN-intrusion-dataset representing the intra-vehicle network data and 99.88% accuracy on the CICIDS2017 dataset illustrating the external vehicular network data. For the zero-day attack detection, the proposed system achieves high F1-scores of 0.963 and 0.800 on the above two datasets, respectively. The average processing time of each data packet on a vehicle-level machine is less than 0.6 ms, which shows the feasibility of implementing the proposed system in real-time vehicle systems. This emphasizes the effectiveness and efficiency of the proposed IDS.
Reputation Bootstrapping for Composite Services using CP-nets
Mistry, Sajib, Bouguettaya, Athman
We propose a novel framework to bootstrap the reputation of on-demand service compositions. On-demand compositions are usually context-aware and have little or no direct consumer feedback. The reputation bootstrapping of single or atomic services does not consider the topology of the composition and relationships among reputation-related factors. We apply Conditional Preference Networks (CP-nets) of reputation-related factors for component services in a composition. The reputation of a composite service is bootstrapped by the composition of CP-nets. We consider the history of invocation among component services to determine reputation-interdependence in a composition. The composition rules are constructed using the composition topology and four types of reputation-influence among component services. A heuristic-based Q-learning approach is proposed to select the optimal set of reputation-related CP-nets. Experimental results prove the efficiency of the proposed approach.