AITopics | Support Vector Machines

Collaborating Authors

Support Vector Machines

Support vector machines (SVMs, also support vector networks[1]) are supervised learning models with associated learning algorithms that analyze data used for classification and regression analysis. (Wikipedia)

News Overviews Instructional Materials AI-Alerts Classics

Multi-level Training and Bayesian Optimization for Economical Hyperparameter Optimization

Yang, Yang, Deng, Ke, Zhu, Michael

arXiv.org Machine LearningJul-20-2020

Hyperparameters play a critical role in the performances of many machine learning methods. Determining their best settings or Hyperparameter Optimization (HPO) faces difficulties presented by the large number of hyperparameters as well as the excessive training time. In this paper, we develop an effective approach to reducing the total amount of required training time for HPO. In the initialization, the nested Latin hypercube design is used to select hyperparameter configurations for two types of training, which are, respectively, heavy training and light training. We propose a truncated additive Gaussian process model to calibrate approximate performance measurements generated by light training, using accurate performance measurements generated by heavy training. Based on the model, a sequential model-based algorithm is developed to generate the performance profile of the configuration space as well as find optimal ones. Our proposed approach demonstrates competitive performance when applied to optimize synthetic examples, support vector machines, fully connected networks and convolutional neural networks.

artificial intelligence, configuration, machine learning, (18 more...)

arXiv.org Machine Learning

2007.09953

Country:

Asia > China (0.04)
North America > United States > New York (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.54)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback

Deep Neural-Kernel Machines

Mehrkanoon, Siamak

arXiv.org Machine LearningJul-19-2020

In this chapter we review the main literature related to the recent advancement of deep neural-kernel architecture, an approach that seek the synergy between two powerful class of models, i.e. kernel-based models and artificial neural networks. The introduced deep neural-kernel framework is composed of a hybridization of the neural networks architecture and a kernel machine. More precisely, for the kernel counterpart the model is based on Least Squares Support Vector Machines with explicit feature mapping. Here we discuss the use of one form of an explicit feature map obtained by random Fourier features. Thanks to this explicit feature map, in one hand bridging the two architectures has become more straightforward and on the other hand one can find the solution of the associated optimization problem in the primal, therefore making the model scalable to large scale datasets. We begin by introducing a neural-kernel architecture that serves as the core module for deeper models equipped with different pooling layers. In particular, we review three neural-kernel machines with average, maxout and convolutional pooling layers. In average pooling layer the outputs of the previous representation layers are averaged. The maxout layer triggers competition among different input representations and allows the formation of multiple sub-networks within the same model. The convolutional pooling layer reduces the dimensionality of the multi-scale output representations. Comparison with neural-kernel model, kernel based models and the classical neural networks architecture have been made and the numerical experiments illustrate the effectiveness of the introduced models on several benchmark datasets.

artificial intelligence, machine learning, neural-kernel network, (16 more...)

arXiv.org Machine Learning

2007.06655

Country:

North America > Canada > Ontario > Toronto (0.14)
Oceania > Australia (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(5 more...)

Genre: Overview (0.86)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.69)

Add feedback

Every Machine Learning Algorithm Can Be Represented as a Neural Network

#artificialintelligenceJul-18-2020, 04:05:21 GMT

It seems that all of the work in machine learning -- starting from early research in the 1950s -- cumulated with the creation of the neural network. Successively, algorithm after new algorithm were proposed, from logistic regression to support vector machines, but the neural network is, very literally, the algorithm of algorithms and the pinnacle of machine learning. It's a universal generalization of what machine learning is, instead of one attempt of doing it. In this sense, it is more of a framework and a concept than simply an algorithm, and this is evident given the massive amount of freedom in constructing neural networks -- hidden layer & node counts, activation functions, optimizers, loss functions, network types (convolutional, recurrent, etc.), and specialized layers (batch norm, dropout, etc.), to name a few. From this perspective of neural networks being a concept rather than a rigid algorithm comes a very interesting corollary: any machine learning algorithm, be it decision trees or k-nearest neighbors, can be represented using a neural network.

artificial intelligence, machine learning, neural network, (15 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.58)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Nearest Neighbor Methods (0.56)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.50)

Add feedback

Training with reduced precision of a support vector machine model for text classification

Żurek, Dominik, Pietroń, Marcin

arXiv.org Machine LearningJul-17-2020

This paper presents the impact of using quantization on the efficiency of multi-class text classification in the training process of a support vector machine (SVM). This work is focused on comparing the efficiency of SVM model trained using reduced precision with its original form. The main advantage of using quantization is decrease in computation time and in memory footprint on the dedicated hardware platform which supports low precision computation like GPU (16-bit) or FPGA (any bit-width). The paper presents the impact of a precision reduction of the SVM training process on text classification accuracy. The implementation of the CPU was performed using the OpenMP library. Additionally, the results of the implementation of the GPU using double, single and half precision are presented.

artificial intelligence, machine learning, training process, (16 more...)

arXiv.org Machine Learning

2007.08657

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
Europe > Poland > Lesser Poland Province > Kraków (0.04)
(5 more...)

Genre: Research Report > New Finding (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (1.00)

Add feedback

Fundamentals of Machine Learning [Hindi][Python]

#artificialintelligenceJul-16-2020, 09:29:11 GMT

Online Courses Udemy - Machine Learning, Fundamentals of Machine Learning [Hindi][Python] Complete hands-on Machine Learning Course with Data Science, NLP, Deep Learning and Artificial Intelligence Created by Rishi Bansal English Students also bought Machine Learning and AI: Support Vector Machines in Python Data Science: Supervised Machine Learning in Python Machine Learning A-Z: Hands-On Python & R In Data Science Machine Learning, Data Science and Deep Learning with Python Data Science and Machine Learning Bootcamp with R Machine Learning Practical: 6 Real-World Applications Preview this course GET COUPON CODE Description This course is designed to understand basic Concept of Machine Learning. Anyone can opt for this course. No prior understanding of Machine Learning is required. NOTE: Course is still under Development. You will see new topics will get added regularly. Now question is why this course?

artificial intelligence, deep learning, machine learning, (13 more...)

#artificialintelligence

Genre: Instructional Material > Course Syllabus & Notes (1.00)

Industry:

Education > Educational Setting > Online (1.00)
Education > Educational Technology > Educational Software > Computer Based Training (0.62)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.59)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.55)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.46)

Add feedback

Large scale analysis of generalization error in learning using margin based classification methods

Huang, Hanwen, Yang, Qinglong

arXiv.org Machine LearningJul-16-2020

Large-margin classifiers are popular methods for classification. We derive the asymptotic expression for the generalization error of a family of large-margin classifiers in the limit of both sample size $n$ and dimension $p$ going to $\infty$ with fixed ratio $\alpha=n/p$. This family covers a broad range of commonly used classifiers including support vector machine, distance weighted discrimination, and penalized logistic regression. Our result can be used to establish the phase transition boundary for the separability of two classes. We assume that the data are generated from a single multivariate Gaussian distribution with arbitrary covariance structure. We explore two special choices for the covariance matrix: spiked population model and two layer neural networks with random first layer weights. The method we used for deriving the closed-form expression is from statistical physics known as the replica method. Our asymptotic results match simulations already when $n,p$ are of the order of a few hundreds. For two layer neural networks, we reproduce the recently developed `double descent' phenomenology for several classification models. We also discuss some statistical insights that can be drawn from these analysis.

classification method, generalization error, regression, (11 more...)

arXiv.org Machine Learning

2007.10112

Country:

North America > United States > Georgia > Clarke County > Athens (0.14)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Asia > China > Hubei Province > Wuhan (0.04)

Genre: Research Report > New Finding (0.87)

Industry: Health & Medicine (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.54)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.35)

Add feedback

Radial basis function kernel optimization for Support Vector Machine classifiers

Thurnhofer-Hemsi, Karl, López-Rubio, Ezequiel, Molina-Cabello, Miguel A., Najarian, Kayvan

arXiv.org Machine LearningJul-16-2020

Since the inception of SVMs [1], the interest for this kind of supervised learning method has only grown over the years [2], so that it has become a well established tool both for classification and regression [3]. SVMs are regarded as the most prominent exemplar of kernel methods, which solve complex machine learning problems by using linear estimation methods on a high dimensional feature space [4]. They are intensely employed in a myriad of applications, including object segmentation [5], video surveillance [6], drug discovery [7], and cancer genomics [8]. The SVM framework models a classification problem as a maximum margin optimization problem, where the decision boundary that has the largest distance (margin) to separate the training points of different classes is searched. There is a primal form of the optimization problem, where the weights to be optimized are associated with the input features, i.e., there is one weight per each input feature. There is also a dual form, where the weights are associated with the training samples, i.e., one weight per each training sample. In the dual form, the weights are Lagrange multipliers of a suitable Lagrangian function. The fewer variables to be optimized, the easier the optimization problem, so dual formulations are preferred for classification tasks with many input features [9]. This work has been submitted to the IEEE for possible publication.

artificial intelligence, configuration, machine learning, (17 more...)

arXiv.org Machine Learning

2007.08233

Country:

North America > United States > Michigan > Washtenaw County > Ann Arbor (0.14)
North America > United States > Wisconsin (0.04)
Europe > Spain > Andalusia > Málaga Province > Málaga (0.04)
(4 more...)

Genre: Research Report (0.82)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (0.68)
Health & Medicine > Therapeutic Area > Oncology (0.49)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (1.00)

Add feedback

Prediction of Cancer Microarray and DNA Methylation Data using Non-negative Matrix Factorization

Patel, Parth, Passi, Kalpdrum, Jain, Chakresh Kumar

arXiv.org Machine LearningJul-15-2020

Over the past few years, there has been a considerable spread of microarray technology in many biological patterns, particularly in those pertaining to cancer diseases like leukemia, prostate, colon cancer, etc. The primary bottleneck that one experiences in the proper understanding of such datasets lies in their dimensionality, and thus for an efficient and effective means of studying the same, a reduction in their dimension to a large extent is deemed necessary. This study is a bid to suggesting different algorithms and approaches for the reduction of dimensionality of such microarray datasets. This study exploits the matrix-like structure of such microarray data and uses a popular technique called Non-Negative Matrix Factorization (NMF) to reduce the dimensionality, primarily in the field of biological data. Classification accuracies are then compared for these algorithms. This technique gives an accuracy of 98%.

artificial intelligence, bioinformatics, machine learning, (13 more...)

arXiv.org Machine Learning

doi: 10.5121/csit.2020.100906

2007.08652

Country:

Asia > India > NCT > Delhi (0.04)
Asia > India > Gujarat (0.04)
North America > Canada > Ontario > Thunder Bay District > Sudbury (0.04)
(2 more...)

Genre: Research Report > New Finding (0.68)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Therapeutic Area > Oncology > Leukemia (0.51)

Technology:

Information Technology > Biomedical Informatics > Translational Bioinformatics (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.47)

Add feedback

Support Vector Machines explained with Python examples

#artificialintelligenceJul-14-2020, 21:45:52 GMT

Support vector machines (SVM) is a supervised machine learning technique. And, even though it's mostly used in classification, it can also be applied to regression problems. SVMs define a decision boundary along with a maximal margin that separates almost all the points into two classes. Support vector machines are an improvement over maximal margin algorithms. Its biggest advantage is that it can define both a linear or a non-linear decision boundary by using kernel functions.

artificial intelligence, decision boundary, machine learning, (16 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (1.00)

Add feedback

Misclassification cost-sensitive ensemble learning: A unifying framework

Petrides, George, Verbeke, Wouter

arXiv.org Machine LearningJul-14-2020

The task of supervised machine learning is given a set of recorded observations and their outcomes to predict the outcome of new observations. Standard classification techniques aim for the highest overall accuracy or, equivalently, for the smallest total error, and include among others support vector machines, Bayesian classifiers, logistic regression, decision tree classifiers such as CART [6] and C4.5 [38], and ensemble methods which build several classifiers and aggregate their predictions such as Bagging [4], AdaBoost [16] and Random Forests [5]. Of particular interest in certain domains are binary classifiers which deal with cases where only two classes of outcomes are considered, such as fraudulent and legitimate credit card transactions, responders and non-responders to a marketing campaign, patients with and without cancer, intrusive and authorised network access, and defaulting and repaying debtors to name a few. In most of these cases, one of the classes is a small minority and consequently traditional classifiers might classify all of its members as belonging to the majority class without any significant overall accuracy loss. The severity of this class imbalance becomes more noticeable when failing to correctly predict a minority class member is more costly than doing so with a member of the majority class, as the case often is. A remedy to the undesirable situation just described are classifiers which, instead of accuracy, take misclassification costs into account and are thus termed cost-sensitive. We illustrate this idea in the credit card fraud detection framework: accepting a fraudulent transaction as legitimate incurs a cost equal to its amount.

artificial intelligence, classifier, machine learning, (18 more...)

arXiv.org Machine Learning

2007.07361

Country:

Europe > Austria > Vienna (0.14)
North America > United States > California (0.04)
Europe > Norway > Western Norway > Vestland > Bergen (0.04)
Europe > Belgium (0.04)

Genre:

Research Report > New Finding (0.34)
Research Report > Experimental Study (0.34)

Industry:

Law Enforcement & Public Safety > Fraud (1.00)
Banking & Finance > Credit (0.89)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.54)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.34)

Add feedback