AITopics | Support Vector Machines

Collaborating Authors

Support Vector Machines

Support vector machines (SVMs, also support vector networks[1]) are supervised learning models with associated learning algorithms that analyze data used for classification and regression analysis. (Wikipedia)

News Overviews Instructional Materials AI-Alerts Classics

Fast Sampling for Bayesian Max-Margin Models

Hu, Wenbo, Zhu, Jun, Zhang, Bo

arXiv.org Artificial IntelligenceOct-18-2016

Bayesian max-margin models have shown superiority in various practical applications, such as text categorization, collaborative prediction, social network link prediction and crowdsourcing, and they conjoin the flexibility of Bayesian modeling and predictive strengths of max-margin learning. However, Monte Carlo sampling for these models still remains challenging, especially for applications that involve large-scale datasets. In this paper, we present the stochastic subgradient Hamiltonian Monte Carlo (HMC) methods, which are easy to implement and computationally efficient. We show the approximate detailed balance property of subgradient HMC which reveals a natural and validated generalization of the ordinary HMC. Furthermore, we investigate the variants that use stochastic subsampling and thermostats for better scalability and mixing. Using stochastic subgradient Markov Chain Monte Carlo (MCMC), we efficiently solve the posterior inference task of various Bayesian max-margin models and extensive experimental results demonstrate the effectiveness of our approach.

artificial intelligence, classifier, machine learning, (13 more...)

arXiv.org Artificial Intelligence

1504.07107

Genre: Research Report > New Finding (0.88)

Industry:

Information Technology (0.48)
Leisure & Entertainment > Sports (0.46)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.89)
(2 more...)

Add feedback

Semi-Supervised Active Learning for Support Vector Machines: A Novel Approach that Exploits Structure Information in Data

Reitmaier, Tobias, Calma, Adrian, Sick, Bernhard

arXiv.org Machine LearningOct-14-2016

In our today's information society more and more data emerges, e.g.~in social networks, technical applications, or business applications. Companies try to commercialize these data using data mining or machine learning methods. For this purpose, the data are categorized or classified, but often at high (monetary or temporal) costs. An effective approach to reduce these costs is to apply any kind of active learning (AL) methods, as AL controls the training process of a classifier by specific querying individual data points (samples), which are then labeled (e.g., provided with class memberships) by a domain expert. However, an analysis of current AL research shows that AL still has some shortcomings. In particular, the structure information given by the spatial pattern of the (un)labeled data in the input space of a classification model (e.g.,~cluster information), is used in an insufficient way. In addition, many existing AL techniques pay too little attention to their practical applicability. To meet these challenges, this article presents several techniques that together build a new approach for combining AL and semi-supervised learning (SSL) for support vector machines (SVM) in classification tasks. Structure information is captured by means of probabilistic models that are iteratively improved at runtime when label information becomes available. The probabilistic models are considered in a selection strategy based on distance, density, diversity, and distribution (4DS strategy) information for AL and in a kernel function (Responsibility Weighted Mahalanobis kernel) for SVM. The approach fuses generative and discriminative modeling techniques. With 20 benchmark data sets and with the MNIST data set it is shown that our new solution yields significantly better results than state-of-the-art methods.

artificial intelligence, kernel, machine learning, (17 more...)

arXiv.org Machine Learning

1610.03995

Country:

North America > United States (1.00)
Europe (1.00)

Genre: Research Report > Promising Solution (0.34)

Industry: Information Technology (0.86)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (1.00)

Add feedback

Exploring the Entire Regularization Path for the Asymmetric Cost Linear Support Vector Machine

Wesierski, Daniel

arXiv.org Machine LearningOct-12-2016

We propose an algorithm for exploring the entire regularization path of asymmetric-cost linear support vector machines. Empirical evidence suggests the predictive power of support vector machines depends on the regularization parameters of the training algorithms. The algorithms exploring the entire regularization paths have been proposed for single-cost support vector machines thereby providing the complete knowledge on the behavior of the trained model over the hyperparameter space. Considering the problem in two-dimensional hyperparameter space though enables our algorithm to maintain greater flexibility in dealing with special cases and sheds light on problems encountered by algorithms building the paths in one-dimensional spaces. We demonstrate two-dimensional regularization paths for linear support vector machines that we train on synthetic and real data.

algorithm, artificial intelligence, machine learning, (13 more...)

arXiv.org Machine Learning

1610.03738

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (1.00)

Add feedback

Recursion-Free Online Multiple Incremental/Decremental Analysis Based on Ridge Support Vector Learning

Chen, Bo-Wei

arXiv.org Machine LearningOct-11-2016

Th is study presents a rapid multiple incremental and decremental mechanism ba sed on Weight - Error Curves (WECs) fo r support - vector a nalysi s . To ha ndle rapidly increas ing amounts of data, recursion - free computation is proposed for predicting the Lagrangian multipliers of new samples . This study examines the characteristics of Ridge S upport V ector M odels, including Ridge S upport V ector Machines and Regression, subsequently devis ing a recursion - free function derived from WECs . With this proposed function, a ll of the new Lagrang ian multipliers can be computed at once without using any gradual step sizes. Moreover, such a function can relax a constraint, where the increment of new multiple Lagrang ian multipliers should be the same in the previous work, thereby easily satisfying the requirement of Karush - Kuhn - Tucker (KKT) conditions . The proposed mechanism no longer requires t ypical time - consuming bookkeeping strategies, which compute the step size by checking all the training samples in each incremental round. Experiments were carried out on open datasets for evaluating our work. The results showed that the computation al speed was successfully enhanced, better than the baselines. Besides, the accuracy still remained. These findings revealed that the proposed method was appropriate for incremental/decremental learning, thereby demonstrating the effectiveness of the propose d idea.

artificial intelligence, lagrangian multiplier, machine learning, (13 more...)

arXiv.org Machine Learning

1608.00619

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.28)
North America > United States > California (0.28)

Genre: Research Report > New Finding (0.86)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.91)

Add feedback

There's an app for that! Using your smartphone to test for Anemia. » Behind the Headlines

#artificialintelligenceOct-6-2016, 12:51:09 GMT

I'd be willing to bet that if you were asked to list ten uses for your smartphone, you probably wouldn't include "medical device" in your answer. But as smartphones become increasingly capable, highly-portable computing platforms, researchers are looking to the computer in everyone's pocket as a way to improve global health. As Wired UK declared earlier this year, the next revolutionary medical device is likely to be your smartphone. Scientists have already developed smartphone-based apps that can monitor asthma, detect skin cancer, and diagnose traumatic brain injuries. The latest app that joins the "doctor in your pocket" list is helping screen for anemia.

artificial intelligence, machine learning, smartphone, (15 more...)

#artificialintelligence

Country:

North America > United States (0.34)
Europe > Germany (0.05)

Genre: Research Report (0.34)

Industry:

Health & Medicine > Therapeutic Area > Hematology (1.00)
Government > Regional Government > North America Government > United States Government > FDA (0.34)

Technology:

Information Technology > Communications > Mobile (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.30)

Add feedback

The rapid evolution of open-source machine learning – Seldon -- Open Source Machine Learning

#artificialintelligenceOct-4-2016, 07:35:43 GMT

When millions of people across the world tuned in to watch DeepMind's machine beat the human Go world champion Lee Sedol, they also witnessed a historic victory for open-source. DeepMind used a scientific computing framework called Torch extensively in the development and execution of AlphaGo's neural networks. Torch was first released back in 2002 under a BSD open-source license with algorithms that are still commonly used by data scientists such as multi-layer perceptrons, support vector machines and K-nearest neighbours. Torch also supported ensembles -- a popular technique that combines the output of multiple algorithms, usually with a weighted average. It's not just open-source software that contributed to the growth of machine learning.

artificial intelligence, machine learning, seldon, (14 more...)

#artificialintelligence

Country: North America > United States > California (0.05)

Industry:

Information Technology (1.00)
Leisure & Entertainment > Games > Go (0.55)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.56)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.55)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Perceptrons (0.55)

Add feedback

Stealing Machine Learning Models via Prediction APIs

Tramèr, Florian, Zhang, Fan, Juels, Ari, Reiter, Michael K., Ristenpart, Thomas

arXiv.org Machine LearningOct-2-2016

Machine learning (ML) models may be deemed confidential due to their sensitive training data, commercial value, or use in security applications. Increasingly often, confidential ML models are being deployed with publicly accessible query interfaces. ML-as-a-service ("predictive analytics") systems are an example: Some allow users to train models on potentially sensitive data and charge others for access on a pay-per-query basis. The tension between model confidentiality and public access motivates our investigation of model extraction attacks. In such attacks, an adversary with black-box access, but no prior knowledge of an ML model's parameters or training data, aims to duplicate the functionality of (i.e., "steal") the model. Unlike in classical learning theory settings, ML-as-a-service offerings may accept partial feature vectors as inputs and include confidence values with predictions. Given these practices, we show simple, efficient attacks that extract target ML models with near-perfect fidelity for popular model classes including logistic regression, neural networks, and decision trees. We demonstrate these attacks against the online services of BigML and Amazon Machine Learning. We further show that the natural countermeasure of omitting confidence values from model outputs still admits potentially harmful model extraction attacks. Our results highlight the need for careful ML model deployment and new model extraction countermeasures.

artificial intelligence, machine learning, query, (17 more...)

arXiv.org Machine Learning

1609.02943

Genre: Research Report > New Finding (1.00)

Industry:

Information Technology > Security & Privacy (1.00)
Banking & Finance (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.49)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Minimum Density Hyperplanes

Pavlidis, Nicos G., Hofmeyr, David P., Tasoulis, Sotiris K.

arXiv.org Machine LearningSep-28-2016

Associating distinct groups of objects (clusters) with contiguous regions of high probability density (high-density clusters), is central to many statistical and machine learning approaches to the classification of unlabelled data. We propose a novel hyperplane classifier for clustering and semi-supervised classification which is motivated by this objective. The proposed minimum density hyperplane minimises the integral of the empirical probability density function along it, thereby avoiding intersection with high density clusters. We show that the minimum density and the maximum margin hyperplanes are asymptotically equivalent, thus linking this approach to maximum margin clustering and semi-supervised support vector classifiers. We propose a projection pursuit formulation of the associated optimisation problem which allows us to find minimum density hyperplanes efficiently in practice, and evaluate its performance on a range of benchmark data sets. The proposed approach is found to be very competitive with state of the art methods for clustering and semi-supervised classification.

artificial intelligence, hyperplane, machine learning, (16 more...)

arXiv.org Machine Learning

1507.04201

Country: North America > United States (0.28)

Genre: Research Report > Promising Solution (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.66)

Add feedback

Three Things About Data Science You Won't Find In the Books

#artificialintelligenceSep-26-2016, 20:55:25 GMT

In case you haven't heard yet, Data Science is all the craze. Courses, posts, and schools are springing up everywhere. However, every time I take a look at one of those offerings, I see that a lot of emphasis is put on specific learning algorithms. Of course, understanding how logistic regression or deep learning works is cool, but once you start working with data, you find out that there are other things equally important, or maybe even more. I can't really blame these courses.

artificial intelligence, future data, machine learning, (16 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.30)

Add feedback

A Hybrid Machine Learning Method for Fusing fMRI and Genetic Data: Combining both Improves Classification of Schizophrenia

#artificialintelligenceSep-26-2016, 03:05:23 GMT

We demonstrate a hybrid machine learning method to classify schizophrenia patients and healthy controls, using functional magnetic resonance imaging (fMRI) and single nucleotide polymorphism (SNP) data. The method consists of four stages: (1) SNPs with the most discriminating information between the healthy controls and schizophrenia patients are selected to construct a support vector machine ensemble (SNP-SVME). The method was evaluated by a fully validated leave-one-out method using 40 subjects (20 patients and 20 controls). The classification accuracy was: 0.74 for SNP-SVME, 0.82 for Voxel-SVME, 0.83 for ICA-SVMC, and 0.87 for Combined SNP-fMRI. Experimental results show that better classification accuracy was achieved by combining genetic and fMRI data than using either alone, indicating that genetic and brain function representing different, but partially complementary aspects, of schizophrenia etiopathology.

artificial intelligence, fusing fmri and genetic data, hybrid machine learning method, (9 more...)

#artificialintelligence

Genre: Research Report (0.42)

Industry:

Health & Medicine > Therapeutic Area > Psychiatry/Psychology (1.00)
Health & Medicine > Health Care Technology (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.81)

Add feedback