Support Vector Machines
A new asymmetric $\epsilon$-insensitive pinball loss function based support vector quantile regression model
Anand, Pritam, Rastogi, Reshma, Chandra, Suresh
In this paper, we propose a novel asymmetric $\epsilon$-insensitive pinball loss function for quantile estimation. There exists some pinball loss functions which attempt to incorporate the $\epsilon$-insensitive zone approach in it but, they fail to extend the $\epsilon$-insensitive approach for quantile estimation in true sense. The proposed asymmetric $\epsilon$-insensitive pinball loss function can make an asymmetric $\epsilon$- insensitive zone of fixed width around the data and divide it using $\tau$ value for the estimation of the $\tau$th quantile. The use of the proposed asymmetric $\epsilon$-insensitive pinball loss function in Support Vector Quantile Regression (SVQR) model improves its prediction ability significantly. It also brings the sparsity back in SVQR model. Further, the numerical results obtained by several experiments carried on artificial and real world datasets empirically show the efficacy of the proposed `$\epsilon$-Support Vector Quantile Regression' ($\epsilon$-SVQR) model over other existing SVQR models.
Transfer Learning-Based Label Proportions Method with Data of Uncertainty
Xiao, Yanshan, Wang, HuaiPei, Liu, Bo
Learning with label proportions(LLP), which seeks an instance-level classifier merely based on bag-level label proportions, is a new paradigm in machine learning that addresses the classification of instances [1, 2, 3]. In LLP, we only know the proportions of examples belonging to different classes in each bag; however the labels of the instances are unknown. From the binary classification perspective, the task of LLP is to learn a classifier to classify the unknown label instance as either positive class or negative class. The formulation that learning with label proportions has been first proposed by Kuck et al. in [1], which can be used for political elections analysis. In the case of politician polls, each candidate may have a group of loyal voters and some swing voters. They may know the vague proportion of votes cast in each district; however, they usually do not know the vote of each person. Since the candidates have limited resources, they have to analyze political elections and consider which kind of voters they should focus on so as to maximize their interests. To date, LLP has been applied to forecasting revenue [4], image classification [5, 6], video event detection [7], demographics mining [8] and privacy protection [9]. Figure 1 illustrates the binary classification problem in LLP.
AI Predicts Independent Construction Safety Outcomes from Universal Attributes
Baker, Henrietta, Hallowell, Matthew R., Tixier, Antoine J. -P.
These pro-3 grams rely on patterns and inference, rather than explicit instructions, to achieve their aims [5]. ML in construction has been developed significantly since 1991 when [6] first discussed the potential of neural networks in construction engineering and management. Early examples of ML in construction include applications such as [7] where the AQ15 algorithm was applied to automatically learn the mapping between constructability (poor, good, excellent) and 7 predictors from a collection of 31 training examples; and [8] who applied decision trees and neural networks to a construction management database to identify the causes of delays. Many subsequent prediction applications applied support vector machines (SVMs), owing to their consistently high accuracy. These applications include [9], who accurately forecasted contractor prequalification using input variables such as financial strength and current workload; [10], who estimated building cost and loss risk from ten input variables; and [11], who detected concrete structural components in color images from actual construction sites. In the last 5 years, use of ML in construction has become far more widespread and the methods and applications used are far more diverse.
Combining Prediction Intervals on Multi-Source Non-Disclosed Regression Datasets
Spjuth, Ola, Brรคnnstrรถm, Robin Carriรณn, Carlsson, Lars, Gauraha, Niharika
Department of Pharmaceutical Biosciences Uppsala University, Uppsala, Sweden Abstract Conformal Prediction is a framework that produces prediction intervals based on the output from a machine learning algorithm. In this paper we explore the case when training data is made up of multiple parts available in different sources that cannot be pooled. We here consider the regression case and propose a method where a conformal predictor is trained on each data source independently, and where the prediction intervals are then combined into a single interval. We call the approach Non-Disclosed Conformal Prediction (NDCP), and we evaluate it on a regression dataset from the UCI machine learning repository using support vector regression as the underlying machine learning algorithm, with varying number of data sources and sizes. The results show that the proposed method produces conservatively valid prediction intervals, and while we cannot retain the same efficiency as when all data is used, efficiency is improved through the proposed approach as compared to predicting using a single arbitrarily chosen source.
Double-Coupling Learning for Multi-Task Data Stream Classification
Shi, Yingzhong, Deng, Zhaohong, Chen, Haoran, Choi, Kup-Sze, Wang, Shitong
Data stream classification methods demonstrate promising performance on a single data stream by exploring the cohesion in the data stream. However, multiple data streams that involve several correlated data streams are common in many practical scenarios, which can be viewed as multi-task data streams. Instead of handling them separately, it is beneficial to consider the correlations among the multi-task data streams for data stream modeling tasks. In this regard, a novel classification method called double-coupling support vector machines (DC-SVM), is proposed for classifying them simultaneously. DC-SVM considers the external correlations between multiple data streams, while handling the internal relationship within the individual data stream. Experimental results on artificial and real-world multi-task data streams demonstrate that the proposed method outperforms traditional data stream classification methods.
Resonant Machine Learning Based on Complex Growth Transform Dynamical Systems
Chatterjee, Oindrila, Chakrabartty, Shantanu
In this paper we propose an energy-efficient learning framework which exploits structural and functional similarities between a machine learning network and a general electrical network satisfying the Tellegen's theorem. The proposed formulation ensures that the network's active-power is dissipated only during the process of learning, whereas the network's reactive-power is maintained to be zero at all times. As a result, in steady-state, the learned parameters are stored and self-sustained by electrical resonance determined by the network's nodal inductances and capacitances. Based on this approach, this paper introduces three novel concepts: (a) A learning framework where the network's active-power dissipation is used as a regularization for a learning objective function that is subjected to zero total reactive-power constraint; (b) A dynamical system based on complex-domain, continuous-time growth transforms which optimizes the learning objective function and drives the network towards electrical resonance under steady-state operation; and (c) An annealing procedure that controls the trade-off between active-power dissipation and the speed of convergence. As a representative example, we show how the proposed framework can be used for designing resonant support vector machines (SVMs), where we show that the support-vectors correspond to an LC network with self-sustained oscillations. We also show that this resonant network dissipates less active-power compared to its non-resonant counterpart.
Skin Lesion Segmentation and Classification for ISIC 2018 by Combining Deep CNN and Handcrafted Features
Ali, Redha, Hardie, Russell C., De Silva, Manawaduge Supun, Kebede, Temesguen Messay
This short report describes our submission to the ISIC 2018 Challenge in Skin Lesion Analysis Towards Melanoma Detection for Task1 and Task 3. This work has been accomplished by a team of researchers at the University of Dayton Signal and Image Processing Lab. Our proposed approach is computationally efficient are combines information from both deep learning and handcrafted features. For Task3, we form a new type of image features, called hybrid features, which has stronger discrimination ability than single method features. These features are utilized as inputs to a decision-making model that is based on a multiclass Support Vector Machine (SVM) classifier. The proposed technique is evaluated on online validation databases. Our score was 0.841 with SVM classifier on the validation dataset.
Comparison theorems on large-margin learning
Classification is a very important research topic in statist ical machine learning. There are a large amount of literature on various classification methods, ran ging from the very classical distribution-based likelihood approaches such as Fisher linear discriminant analysis (LDA) and logistic regression [3], to the margin-based approaches such as the well-known s upport vector machine (SVM) [1, 2]. Each type of classifiers has their own merits. Recently, Liu a nd his coauthors proposed in [4] the so-called large-margin unified machines (LUMs) which es tablish a unique transition between these two types of classifiers. As noted in [5], SVM may suffer fr om data piling problems in the high-dimension low-sample size (HDLSS) settings, that is, the support vectors will pile up on top of each other at the margin boundaries when projected onto th e normal vector of the separating hyperplane.
Flood Prediction Using Machine Learning Models: Literature Review
Mosavi, Amir, Ozturk, Pinar, Chau, Kwok-wing
Floods are among the most destructive natural disasters, which are highly complex to model. The research on the advancement of flood prediction models contributed to risk reduction, policy suggestion, minimization of the loss of human life, and reduction the property damage associated with floods. To mimic the complex mathematical expressions of physical processes of floods, during the past two decades, machine learning (ML) methods contributed highly in the advancement of prediction systems providing better performance and cost-effective solutions. Due to the vast benefits and potential of ML, its popularity dramatically increased among hydrologists. Researchers through introducing novel ML methods and hybridizing of the existing ones aim at discovering more accurate and efficient prediction models. The main contribution of this paper is to demonstrate the state of the art of ML models in flood prediction and to give insight into the most suitable models. In this paper, the literature where ML models were benchmarked through a qualitative analysis of robustness, accuracy, effectiveness, and speed are particularly investigated to provide an extensive overview on the various ML algorithms used in the field. The performance comparison of ML models presents an in-depth understanding of the different techniques within the framework of a comprehensive evaluation and discussion. As a result, this paper introduces the most promising prediction methods for both long-term and short-term floods. Furthermore, the major trends in improving the quality of the flood prediction models are investigated. Among them, hybridization, data decomposition, algorithm ensemble, and model optimization are reported as the most effective strategies for the improvement of ML methods.
Quantum-enhanced least-square support vector machine: simplified quantum algorithm and sparse solutions
Lin, Jie, Zhang, Dan-Bo, Zhang, Shuo, Wang, Xiang, Li, Tan, Bao, Wan-su
Quantum algorithms can enhance machine learning in different aspects. Here, we study quantum-enhanced least-square support vector machine (LS-SVM). Firstly, a novel quantum algorithm that uses continuous variable to assist matrix inversion is introduced to simplify the algorithm for quantum LS-SVM, while retaining exponential speed-up. Secondly, we propose a hybrid quantum-classical version for sparse solutions of LS-SVM. By encoding a large dataset into a quantum state, a much smaller transformed dataset can be extracted using quantum matrix toolbox, which is further processed in classical SVM. We also incorporate kernel methods into the above quantum algorithms, which uses both exponential growth Hilbert space of qubits and infinite dimensionality of continuous variable for quantum feature maps. The quantum LS-SVM exploits quantum properties to explore important themes for SVM such as sparsity and kernel methods, and stresses its quantum advantages ranging from speed-up to the potential capacity to solve classically difficult machine learning tasks.