AITopics

1911.09061

Country:

North America > United States > Georgia > Fulton County > Atlanta (0.05)
North America > United States > New Jersey > Passaic County > Clifton (0.04)
Europe > Netherlands > South Holland > Dordrecht (0.04)
(2 more...)

Genre: Research Report (0.82)

Industry: Government (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.86)

arXiv.org Machine LearningNov-18-2019

Benchmarking time series classification -- Functional data vs machine learning approaches

Pfisterer, Florian, Beggel, Laura, Sun, Xudong, Scheipl, Fabian, Bischl, Bernd

Time series classification problems have drawn increasing attention in the machine learning and statistical community. Closely related is the field of functional data analysis (FDA): it refers to the range of problems that deal with the analysis of data that is continuously indexed over some domain. While often employing different methods, both fields strive to answer similar questions, a common example being classification or regression problems with functional covariates. We study methods from functional data analysis, such as functional generalized additive models, as well as functionality to concatenate (functional-) feature extraction or basis representations with traditional machine learning algorithms like support vector machines or classification trees. In order to assess the methods and implementations, we run a benchmark on a wide variety of representative (time series) data sets, with in-depth analysis of empirical results, and strive to provide a reference ranking for which method(s) to use for non-expert practitioners. Additionally, we provide a software framework in R for functional data analysis for supervised learning, including machine learning and more linear approaches from statistics. This allows convenient access, and in connection with the machine-learning toolbox mlr, those methods can now also be tuned and benchmarked.

algorithm, artificial intelligence, machine learning, (15 more...)

1911.07511

Country:

Europe > Austria > Vienna (0.14)
North America > United States > Wisconsin > Portage County > Stevens Point (0.04)
Europe > Spain > Galicia > Madrid (0.04)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)

Genre:

Research Report (0.82)
Overview (0.68)

Industry: Health & Medicine (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.54)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.46)

arXiv.org Machine LearningNov-17-2019

Sequence-Aware Factorization Machines for Temporal Predictive Analytics

Chen, Tong, Yin, Hongzhi, Nguyen, Quoc Viet Hung, Peng, Wen-Chih, Li, Xue, Zhou, Xiaofang

--In various web applications like targeted advertising and recommender systems, the available categorical features (e.g., product type) are often of great importance but sparse. As a widely adopted solution, models based on Factorization Machines (FMs) are capable of modelling high-order interactions among features for effective sparse predictive analytics. As the volume of web-scale data grows exponentially over time, sparse predictive analytics inevitably involves dynamic and sequential features. However, existing FMbased models assume no temporal orders in the data, and are unable to capture the sequential dependencies or patterns within the dynamic features, impeding the performance and adaptivity of these methods. Hence, in this paper, we propose a novel Sequence-A ware Factorization Machine (SeqFM) for temporal predictive analytics, which models feature interactions by fully investigating the effect of sequential dependencies. As static features (e.g., user gender) and dynamic features (e.g., user interacted items) express different semantics, we innovatively devise a multi-view self-attention scheme that separately models the effect of static features, dynamic features and the mutual interactions between static and dynamic features in three different views. In SeqFM, we further map the learned representations of feature interactions to the desired output with a shared residual network. T o showcase the versatility and generalizability of SeqFM, we test SeqFM in three popular application scenarios for FMbased models, namely ranking, classification and regression tasks. Extensive experimental results on six large-scale datasets demonstrate the superior effectiveness and efficiency of SeqFM. As an important supervised learning scheme, predictive analytics play a pivotal role in various applications, ranging from recommender systems [1], [2] to financial analysis [3] and online advertising [4], [5]. In practice, the goal of predictive analytics is to learn a mapping function from the observed variables (i.e., features) to the desired output. When dealing with categorical features in predictive analytics, a common approach is to convert such features into one-hot encodings [6]-[8] so that standard regressors like logistic regression [9] and support vector machines [10] can be directly applied. Due to the large number of possible category variables, the converted one-hot features are usually of high dimensionality but sparse [11], and simply using raw features rarely provides optimal results. The interactions among multiple raw features are usually termed as cross features [7] (a.k.a.

feature interaction, interaction, seqfm, (15 more...)

1911.02752

Country:

North America > United States > California > Santa Clara County > Palo Alto (0.04)
Oceania > Australia > Queensland (0.04)
Asia > China > Shanghai > Shanghai (0.04)

Genre: Research Report > New Finding (0.66)

Industry:

Marketing (0.34)
Information Technology > Services (0.34)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.54)

#artificialintelligenceNov-16-2019, 17:43:26 GMT

Facebook Stock Prediction Using Python & Machine Learning

In this article I will show you how to create your own stock prediction Python program using a machine learning algorithm called Support Vector Regression (SVR). The program will read in Facebook (FB) stock data and make a prediction of the open price based on the day. A Support Vector Regression (SVR) is a type of Support Vector Machine,and is a type of supervised learning algorithm that analyzes data for regression analysis. In 1996, this version of SVM for regression was proposed by Christopher J. C. Burges, Vladimir N. Vapnik, Harris Drucker, Alexander J. Smola and Linda Kaufman. The model produced by SVR depends only on a subset of the training data, because the cost function for building the model ignores any training data close to the model prediction.

machine learning, prediction, support vector regression, (9 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (1.00)

arXiv.org Machine LearningNov-14-2019

Face shape classification using Inception v3

Tio, Adonis Emmanuel

In this paper, we present experimental results obtained from retraining the last layer of the Inception v3 model in classifying images of human faces into one of five basic face shapes. The accuracy of the retrained Inception v3 model was compared with that of the following classification methods that uses facial landmark distance ratios and angles as features: linear discriminant analysis (LDA), support vector machines with linear kernel (SVM-LIN), support vector machines with radial basis function kernel (SVM-RBF), artificial neural networks or multilayer perceptron (MLP), and k-nearest neighbors (KNN). All classifiers were trained and tested using a total of 500 images of female celebrities with known face shapes collected from the Internet. Results show that training accuracy and overall accuracy ranges from 98.0% to 100% and from 84.4% to 84.8% for Inception v3 and from 50.6% to 73.0% and from 36.4% to 64.6% for the other classifiers depending on the training set size used. This result shows that the retrained Inception v3 model was able to fit the training data well and outperform the other classifiers without the need to handpick specific features to include in model training. Future work should consider expanding the labeled dataset, preferably one that can also be freely distributed to the research community, so that proper model cross-validation can be performed. As far as we know, this is the first in the literature to use convolutional neural networks in face-shape classification. The scripts are available at https://github.com/adonistio/inception-face-shape-classifier.

classification, classifier, face shape, (13 more...)

1911.07916

Country:

North America > United States > Nevada > Clark County > Las Vegas (0.04)
North America > Canada > Ontario (0.04)
Asia > Thailand > Chiang Mai > Chiang Mai (0.04)
Asia > Philippines > Luzon > National Capital Region > City of Quezon (0.04)

Genre: Research Report (0.90)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Perceptrons (0.70)

#artificialintelligenceNov-13-2019, 14:44:16 GMT

Machine Learning in the Quantum Era

Machine Learning aims at automatically identifying structures and patterns in large data sets. In order to identify these patterns, algorithms often resort to standard linear algebra routines such as matrix inversion or eigenvalue decompositions. For example, support vector machines, one of the most successful traditional machine learning approaches for classification, can be cast to a linear system of equation, and then be solved using matrix inversion. Similarly, identifying the important signals in a data set can be done by identifying the leading eigenvalues and vectors of the data matrix, a method called principal component analysis. The large dimensionality of the vector spaces involved in these operations make their implementation at large scale very resource intensive, thus motivating the development of innovative methods to lower their computational cost.

algorithm, machine learning, quantum state, (15 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.55)

Yan, Ziang, Guo, Yiwen, Zhang, Changshui

Adversarial Margin Maximization Networks

arXiv.org Machine LearningNov-13-2019

The tremendous recent success of deep neural networks (DNNs) has sparked a surge of interest in understanding their predictive ability. Unlike the human visual system which is able to generalize robustly and learn with little supervision, DNNs normally require a massive amount of data to learn new concepts. In addition, research works also show that DNNs are vulnerable to adversarial examples-maliciously generated images which seem perceptually similar to the natural ones but are actually formed to fool learning models, which means the models have problem generalizing to unseen data with certain type of distortions. In this paper, we analyze the generalization ability of DNNs comprehensively and attempt to improve it from a geometric point of view. We propose adversarial margin maximization (AMM), a learning-based regularization which exploits an adversarial perturbation as a proxy. It encourages a large margin in the input space, just like the support vector machines. With a differentiable formulation of the perturbation, we train the regularized DNNs simply through back-propagation in an end-to-end manner. Experimental results on various datasets (including MNIST, CIFAR-10/100, SVHN and ImageNet) and different DNN architectures demonstrate the superiority of our method over previous state-of-the-arts. Code and models for reproducing our results will be made publicly available.

architecture, experiment, generalization ability, (16 more...)

doi: 10.1109/TPAMI.2019.2948348

1911.05916

Country:

Asia > China > Beijing > Beijing (0.05)
Asia > China > Hubei Province > Wuhan (0.04)

Genre: Research Report > New Finding (0.66)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.69)

#artificialintelligenceNov-11-2019, 20:46:34 GMT

bims-arihec 2019-11-10 papers

OBJECTIVE: To evaluate the potential value of the machine learning (ML)-based MRI texture analysis for predicting 1p/19q codeletion status of lower-grade gliomas (LGG), using various state-of-the-art ML algorithms.MATERIALS AND METHODS: For this retrospective study, 107 patients with LGG were included from a public database. Texture features were extracted from conventional T2-weighted and contrast-enhanced T1-weighted MRI images, using LIFEx software. Training and unseen validation splits were created using stratified 10-fold cross-validation technique along with minority over-sampling. Dimension reduction was done using collinearity analysis and feature selection (ReliefF). Classifications were done using adaptive boosting, k-nearest neighbours, naive Bayes, neural network, random forest, stochastic gradient descent, and support vector machine.

algorithm, mri texture analysis, neural network, (10 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.64)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.64)

#artificialintelligenceNov-11-2019, 17:23:16 GMT

Day #84: Model training progress

Two dimensional data is easier to visualize than 202 dimensional data. I don't normally leave my computer on 24 hours a day, but it has been hard a work, so it hasn't been off in days. Training a model can take some time, as I am finding out and while I've made progress on the resulting model, it's still not where I want it. Let's talk about what has been going on. If you recall, my post a few days ago covered what I've been up to these past few days.

accuracy, hyperplane, model training progress, (8 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.54)

Burdack, Johannes, Horst, Fabian, Giesselbach, Sven, Hassan, Ibrahim, Daffner, Sabrina, Schöllhorn, Wolfgang I.

Systematic Comparison of the Influence of Different Data Preprocessing Methods on the Classification of Gait Using Machine Learning

arXiv.org Machine LearningNov-11-2019

Human movements are characterized by highly non-linear and multi-dimensional interactions within the motor system. Recently, an increasing emphasis on machine-learning applications has led to a significant contribution to the field of gait analysis e.g. in increasing the classification accuracy. In order to ensure the generalizability of the machine-learning models, different data preprocessing steps are usually carried out to process the measured raw data before the classifications. In the past, various methods have been used for each of these preprocessing steps. However, there are hardly any standard procedures or rather systematic comparisons of these different methods and their impact on the classification accuracy. Therefore, the aim of this analysis is to compare different combinations of commonly applied data preprocessing steps and test their effects on the classification accuracy of gait patterns. A publicly available dataset on intra-individual changes of gait patterns was used for this analysis. Forty-two healthy subjects performed 6 sessions of 15 gait trials for one day. For each trial, two force plates recorded the 3D ground reaction forces (GRF). The data was preprocessed with the following steps: GRF filtering, time derivative, time normalization, data reduction, weight normalization and data scaling. Subsequently, combinations of all methods from each individual preprocessing step were analyzed and compared with respect to their prediction accuracy in a six-session classification using Support Vector Machines, Random Forest Classifiers and Multi-Layer Perceptrons. In conclusion, the present results provide first domain-specific recommendations for commonly applied data preprocessing methods and might help to build more comparable and more robust classification models based on machine learning that are suitable for a practical application.

accuracy, classification, prediction accuracy, (14 more...)

1911.04335

Country:

Europe > Germany > Rheinland-Pfalz > Mainz (0.04)
Asia > China > Shanghai > Shanghai (0.04)
Africa > Middle East > Egypt > Sharqia Governorate > Zagazig (0.04)
(5 more...)

Genre:

Workflow (0.96)
Research Report > New Finding (0.69)
Research Report > Experimental Study (0.47)

Industry: Health & Medicine > Therapeutic Area (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.74)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Perceptrons (0.54)