AITopics | Support Vector Machines

Collaborating Authors

Support Vector Machines

Support vector machines (SVMs, also support vector networks[1]) are supervised learning models with associated learning algorithms that analyze data used for classification and regression analysis. (Wikipedia)

News Overviews Instructional Materials AI-Alerts Classics

Enumeration of Distinct Support Vectors for Interactive Decision Making

Kanamori, Kentaro, Hara, Satoshi, Ishihata, Masakazu, Arimura, Hiroki

arXiv.org Machine LearningJun-5-2019

In conventional prediction tasks, a machine learning algorithm outputs a single best model that globally optimizes its objective function, which typically is accuracy. Therefore, users cannot access the other models explicitly. In contrast to this, multiple model enumeration attracts increasing interests in non-standard machine learning applications where other criteria, e.g., interpretability or fairness, than accuracy are main concern and a user may want to access more than one non-optimal, but suitable models. In this paper, we propose a K-best model enumeration algorithm for Support Vector Machines (SVM) that given a dataset S and an integer K>0, enumerates the K-best models on S with distinct support vectors in the descending order of the objective function values in the dual SVM problem. Based on analysis of the lattice structure of support vectors, our algorithm efficiently finds the next best model with small latency. This is useful in supporting users's interactive examination of their requirements on enumerated models. By experiments on real datasets, we evaluated the efficiency and usefulness of our algorithm.

artificial intelligence, machine learning, supp, (14 more...)

arXiv.org Machine Learning

1906.01876

Country:

North America > United States (0.47)
Asia > Japan (0.28)

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (1.00)

Add feedback

Confidence Regions in Wasserstein Distributionally Robust Estimation

Blanchet, Jose, Murthy, Karthyek, Si, Nian

arXiv.org Machine LearningJun-4-2019

Wasserstein distributionally robust optimization (DRO) estimators are obtained as solutions of min-max problems in which the statistician selects a parameter minimizing the worst-case loss among all probability models within a certain distance (in a Wasserstein sense) from the underlying empirical measure. While motivated by the need to identify model parameters (or) decision choices that are robust to model uncertainties and misspecification, the Wasserstein DRO estimators recover a wide range of regularized estimators, including square-root LASSO and support vector machines, among others, as particular cases. This paper studies the asymptotic normality of underlying DRO estimators as well as the properties of an optimal (in a suitable sense) confidence region induced by the Wasserstein DRO formulation.

artificial intelligence, confidence region, machine learning, (15 more...)

arXiv.org Machine Learning

1906.01614

Country: North America > United States > California (0.28)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.54)

Add feedback

On the Correctness and Sample Complexity of Inverse Reinforcement Learning

Komanduru, Abi, Honorio, Jean

arXiv.org Machine LearningJun-2-2019

Inverse reinforcement learning (IRL) is the problem of finding a reward function that generates a given optimal policy for a given Markov Decision Process. This paper looks at an algorithmic-independent geometric analysis of the IRL problem with finite states and actions. A L1-regularized Support Vector Machine formulation of the IRL problem motivated by the geometric analysis is then proposed with the basic objective of the inverse reinforcement problem in mind: to find a reward function that generates a specified optimal policy. The paper further analyzes the proposed formulation of inverse reinforcement learning with $n$ states and $k$ actions, and shows a sample complexity of $O(n^2 \log (nk))$ for recovering a reward function that generates a policy that satisfies Bellman's optimality condition with respect to the true transition probabilities.

artificial intelligence, machine learning, reinforcement learning, (14 more...)

arXiv.org Machine Learning

1906.00422

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > Indiana > Tippecanoe County > West Lafayette (0.04)
North America > United States > Indiana > Tippecanoe County > Lafayette (0.04)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.57)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.34)

Add feedback

On Coresets for Regularized Loss Minimization

Curtin, Ryan R., Im, Sungjin, Moseley, Ben, Pruhs, Kirk, Samadian, Alireza

arXiv.org Machine LearningMay-31-2019

We design and mathematically analyze sampling-based algorithms for regularized loss minimization problems that are implementable in popular computational models for large data, in which the access to the data is restricted in some way. Our main result is that if the regularizer's effect does not become negligible as the norm of the hypothesis scales, and as the data scales, then a uniform sample of modest size is with high probability a coreset. In the case that the loss function is either logistic regression or soft-margin support vector machines, and the regularizer is one of the common recommended choices, this result implies that a uniform sample of size $O(d \sqrt{n})$ is with high probability a coreset of $n$ points in $\Re^d$. We contrast this upper bound with two lower bounds. The first lower bound shows that our analysis of uniform sampling is tight; that is, a smaller uniform sample will likely not be a core set. The second lower bound shows that in some sense uniform sampling is close to optimal, as significantly smaller core sets do not generally exist.

artificial intelligence, coreset, machine learning, (17 more...)

arXiv.org Machine Learning

1905.10845

Genre:

Research Report > New Finding (0.37)
Research Report > Experimental Study (0.37)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.68)

Add feedback

Neural-Symbolic Argumentation Mining: an Argument in Favour of Deep Learning and Reasoning

Galassi, Andrea, Kersting, Kristian, Lippi, Marco, Shao, Xiaoting, Torroni, Paolo

arXiv.org Artificial IntelligenceMay-31-2019

On the other hand, AM has rapidlyfrom a given document (Lippi 2016). Recent years have seen the development evolved by exploiting state-of-the-art neural architectures of a large number of techniques in this area, on coming from deep learning. So far, the wake of the advancements produced by deep these two worlds have progressed largely independently learning on the whole research field of natural of each other. Only recently, a few works language processing (NLP). Yet, it is widely recognized have taken some steps towards the integration of that the existing AM systems still have such methods, by applying techniques combining a large margin of improvement, as good results sub-symbolic classifiers with knowledge expressed have been obtained with some genres where prior in the form of rules and constraints to AM. knowledge on the structure of the text eases some Niculae et al. (2017) adopted structuredFor instance, AM tasks, but other genres such as legal cases support vector machines and recurrent neural and social media documents still require more networks to collectively classify argument components work (Cabrio and Villata, 2018). Performing and and their relations in short documents, understanding argumentation requires advanced by hard-coding contextual dependencies and constraints reasoning capabilities that are natural skills for humans, of the argument model in a factor graph. but which are difficult to learn for a machine. A joint inference approach for argument component Understanding whether a given piece of classification and relation identification was evidence supports a given claim, or whether two Persing and Ng (2016), followinginstead proposed by claims attack each other, are complex problems a pipeline scheme where integer linear programming that humans are able to address thanks to their is used to enforce mathematical constraints ability to exploit commonsense knowledge, and to on the outcomes of a first-stage set of classifiers.

artificial intelligence, machine learning, natural language, (14 more...)

arXiv.org Artificial Intelligence

1905.09103

Country:

North America (0.46)
Europe > Italy (0.14)
Europe > Germany (0.14)

Genre: Overview (0.94)

Industry: Law (0.54)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Explanation & Argumentation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.86)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.54)

Add feedback

Meniere's Disease Prognosis by Learning from Transient-Evoked Otoacoustic Emission Signals

Kao, Sheng-Lun, Lien, Han-Wen, Liu, Tzu-Chi, Wu, Hau-Tieng, Fang, Te-Yung, Wang, Pa-Chun, Liu, Yi-Wen

arXiv.org Machine LearningMay-30-2019

Accurate prognosis of Meniere disease (MD) is difficult. The aim of this study is to treat it as a machine-learning problem through the analysis of transient-evoked (TE) otoacoustic emission (OAE) data obtained from MD patients. Thirty-three patients who received treatment were recruited, and their distortion-product (DP) OAE, TEOAE, as well as pure-tone audiograms were taken longitudinally up to 6 months after being diagnosed with MD. By hindsight, the patients were separated into two groups: those whose outer hair cell (OHC) functions eventually recovered, and those that did not. TEOAE signals between 2.5-20 ms were dimension-reduced via principal component analysis, and binary classification was performed via the support vector machine. Through cross-validation, we demonstrate that the accuracy of prognosis can reach >80% based on data obtained at the first visit. Further analysis also shows that the TEOAE group delay at 1k and 2k Hz tend to be longer for the group of ears that eventually recovered their OHC functions. The group delay can further be compared between the MD-affected ear and the opposite ear. The present results suggest that TEOAE signals provide abundant information for the prognosis of MD and the information could be extracted by applying machine-learning techniques.

artificial intelligence, machine learning, prognosis, (18 more...)

arXiv.org Machine Learning

1905.13573

Country:

Asia > Taiwan > Taiwan Province > Taipei (0.05)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > United States > North Carolina (0.04)
(2 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry: Health & Medicine > Therapeutic Area > Otolaryngology (0.70)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.69)

Add feedback

High-low level support vector regression prediction approach (HL-SVR) for data modeling with input parameters of unequal sample sizes

Shi, Maolin, Sun, Wei, Song, Xueguan, Li, Hongyou

arXiv.org Machine LearningMay-30-2019

Support vector regression (SVR) has been widely used to reduce the high computational cost of computer simulation. SVR assumes the input parameters have equal sample sizes, but unequal sample sizes are often encountered in engineering practices. To solve this issue, a new prediction approach based on SVR, namely as high-low-level SVR approach (HL-SVR) is proposed for data modeling of input parameters of unequal sample sizes in this paper. The proposed approach is consisted of low-level SVR models for the input parameters of larger sample sizes and high-level SVR model for the input parameters of smaller sample sizes. For each training point of the input parameters of smaller sample sizes, one low-level SVR model is built based on its corresponding input parameters of larger sample sizes and their responses of interest. The high-level SVR model is built based on the obtained responses from the low-level SVR models and the input parameters of smaller sample sizes. Several numerical examples are used to validate the performance of HL-SVR. The experimental results indicate that HL-SVR can produce more accurate prediction results than conventional SVR. The proposed approach is applied on the stress analysis of dental implant, which the structural parameters have massive samples but the material of implant can only be selected from several Ti and its alloys. The prediction performance of the proposed approach is much better than the conventional SVR. The proposed approach can be used for the design, optimization and analysis of engineering systems with input parameters of unequal sample sizes.

artificial intelligence, input parameter, machine learning, (16 more...)

arXiv.org Machine Learning

1906.05777

Country:

Asia > China > Liaoning Province > Dalian (0.04)
Asia > China > Fujian Province > Xiamen (0.04)

Genre: Research Report (0.64)

Industry: Health & Medicine (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (1.00)

Add feedback

Infusing domain knowledge in AI-based "black box" models for better explainability with application in bankruptcy prediction

Islam, Sheikh Rabiul, Eberle, William, Bundy, Sid, Ghafoor, Sheikh Khaled

arXiv.org Artificial IntelligenceMay-30-2019

Although "black box" models such as Artificial Neural Networks, Support Vector Machines, and Ensemble Approaches continue to show superior performance in many disciplines, their adoption in the sensitive disciplines (e.g., finance, healthcare) is questionable due to the lack of interpretability and explainability of the model. In fact, future adoption of "black box" models is difficult because of the recent rule of "right of explanation" by the European Union where a user can ask for an explanation behind an algorithmic decision, and the newly proposed bill by the US government, the "Algorithmic Accountability Act", which would require companies to assess their machine learning systems for bias and discrimination and take corrective measures. Top Bankruptcy Prediction Models are A.I.-based and are in need of better explainability -the extent to which the internal working mechanisms of an AI system can be explained in human terms. Although explainable artificial intelligence is an emerging field of research, infusing domain knowledge for better explainability might be a possible solution. In this work, we demonstrate a way to collect and infuse domain knowledge into a "black box" model for bankruptcy prediction. Our understanding from the experiments reveals that infused domain knowledge makes the output from the black box model more interpretable and explainable.

artificial intelligence, frequent feature, machine learning, (19 more...)

arXiv.org Artificial Intelligence

1905.11474

Country: North America > United States > Tennessee (0.15)

Genre: Research Report (0.64)

Industry:

Transportation > Air (1.00)
Law (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
Banking & Finance > Loans > Mortgages (0.94)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.54)

Add feedback

TMLab SRPOL at SemEval-2019 Task 8: Fact Checking in Community Question Answering Forums

Niewinski, Piotr, Wawer, Aleksander, Pszona, Maria, Janicka, Maria

arXiv.org Machine LearningMay-29-2019

The article describes our submission to SemEval 2019 Task 8 on Fact-Checking in Community Forums. The systems under discussion participated in Subtask A: decide whether a question asks for factual information, opinion/advice or is just socializing. Our primary submission was ranked as the second one among all participants in the official evaluation phase. The article presents our primary solution: Deeply Regularized Residual Neural Network (DRR NN) with Universal Sentence Encoder embeddings. This is followed by a description of two contrastive solutions based on ensemble methods.

classification, machine learning, natural language, (17 more...)

arXiv.org Machine Learning

1906.01515

Country:

Asia > Middle East > Qatar > Ad-Dawhah > Doha (0.05)
Europe > Poland > Masovia Province > Warsaw (0.04)

Genre: Research Report (0.83)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.48)

Add feedback

A Music Classification Model based on Metric Learning and Feature Extraction from MP3 Audio Files

da Silva, Angelo C. Mendes, Nunes, Mauricio A., Neto, Raul Fonseca

arXiv.org Machine LearningMay-29-2019

The development of models for learning music similarity and feature extraction from audio media files is an increasingly important task for the entertainment industry. This work proposes a novel music classification model based on metric learning and feature extraction from MP3 audio files. The metric learning process considers the learning of a set of parameterized distances employing a structured prediction approach from a set of MP3 audio files containing several music genres. The main objective of this work is to make possible learning a personalized metric for each customer. To extract the acoustic information we use the Mel-Frequency Cepstral Coefficient (MFCC) and make a dimensionality reduction with the use of Principal Components Analysis. We attest the model validity performing a set of experiments and comparing the training and testing results with baseline algorithms, such as K-means and Soft Margin Linear Support Vector Machine (SVM). Experiments show promising results and encourage the future development of an online version of the learning model.

application 00, artificial intelligence, machine learning, (16 more...)

arXiv.org Machine Learning

1905.12804

Country:

Europe > Austria > Vienna (0.14)
South America > Brazil > Rio de Janeiro > Rio de Janeiro (0.04)
North America > United States > New York > New York County > New York City (0.04)
(5 more...)

Genre: Research Report > New Finding (0.93)

Industry:

Media > Music (1.00)
Leisure & Entertainment (1.00)
Education > Curriculum > Subject-Specific Education (0.61)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.87)

Add feedback