AITopics

Stamatescu, George, Gerace, Federica, Lucibello, Carlo, Fuss, Ian, White, Langford B.

Signal propagation in continuous approximations of binary neural networks

The training of stochastic neural network models with binary ($\pm1$) weights and activations via a deterministic and continuous surrogate network is investigated. We derive, using mean field theory, a set of scalar equations describing how input signals propagate through the surrogate network. The equations reveal that these continuous models exhibit an order to chaos transition, and the presence of depth scales that limit the maximum trainable depth. Moreover, we predict theoretically and confirm numerically, that common weight initialization schemes used in standard continuous networks, when applied to the mean values of the stochastic binary weights, yield poor training performance. This study shows that, contrary to common intuition, the means of the stochastic binary weights should be initialised close to $\pm 1$ for deeper networks to be trainable.

approximation, neural network, propagation, (15 more...)

1902.00177

Country:

Europe > Italy > Piedmont > Turin Province > Turin (0.04)
Oceania > Australia > South Australia > Adelaide (0.04)
Europe > Sweden > Stockholm > Stockholm (0.04)
(3 more...)

Genre: Research Report (0.41)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Ahmad, Amir, Khan, Shehroz S.

A Novel Initial Clusters Generation Method for K-means-based Clustering Algorithms for Mixed Datasets

Mixed datasets consist of numeric and categorical attributes. Various K-means-based clustering algorithms have been developed to cluster these datasets. Generally, these clustering algorithms use random initial clusters which in turn produce different clustering results in different runs. A few cluster initialisation methods have been developed to compute initial clusters, however, they are either computationally expensive or they do not create the same clustering results in different runs. In this paper, we propose a novel approach to find initial clusters for K-means-based clustering algorithms for mixed datasets. The proposed approach is based on the observation that some data points in datasets remain in the same clusters created by K-means-based clustering algorithm irrespective of the choice of initial clusters. It is proposed that individual attribute information can be used to create initial clusters. A K-means-based clustering algorithm is run many times, in each run one of the attributes is used to create initial clusters. The clustering results of various runs are combined to produce a clustering result. This clustering result is used as initial clusters for a K-means-based clustering algorithm. Experiments with various categorical and mixed datasets showed that the proposed clustering approach produced accurate and consistent results.

algorithm, dataset, initial cluster method, (13 more...)

doi: 10.13140/RG.2.2.21979.62244

1902.00127

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
North America > Canada > Ontario > Toronto (0.04)
Europe > Ireland (0.04)
(7 more...)

Genre: Research Report > New Finding (0.67)

Industry: Health & Medicine (0.47)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)

Melhart, David, Azadvar, Ahmad, Canossa, Alessandro, Liapis, Antonios, Yannakakis, Georgios N.

Your Gameplay Says it All: Modelling Motivation in Tom Clancy's The Division

Is it possible to predict the motivation of players just by observing their gameplay data? Even if so, how should we measure motivation in the first place? To address the above questions, on the one end, we collect a large dataset of gameplay data from players of the popular game Tom Clancy's The Division (Ubisoft, 2016). On the other end we ask them to report their levels of competence, autonomy, relatedness and presence using the in-house designed Ubisoft Perceived Experience Questionnaire. After processing the survey responses in an ordinal fashion we employ preference learning methods, based on support vector machines, to infer the mapping between gameplay and the four motivation factors. Our key findings suggest that gameplay features are strong predictors of player motivation as the obtained models reach accuracies of near certainty, in particular, from 93% up to 97% on unseen players.

motivation, motivation factor, proceedings, (17 more...)

1902.0004

Country:

North America > United States > New York (0.04)
Oceania > New Zealand (0.04)
Oceania > Australia (0.04)
(6 more...)

Genre:

Research Report > New Finding (1.00)
Questionnaire & Opinion Survey (1.00)

Industry:

Leisure & Entertainment > Games > Computer Games (1.00)
Information Technology (0.93)
Health & Medicine (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.87)

Kontar, Raed, Raskutti, Garvesh, Zhou, Shiyu

Minimizing Negative Transfer of Knowledge in Multivariate Gaussian Processes: A Scalable and Regularized Approach

Recently there has been an increasing interest in the multivariate Gaussian process (MGP) which extends the Gaussian process (GP) to deal with multiple outputs. One approach to construct the MGP and account for non-trivial commonalities amongst outputs employs a convolution process (CP). The CP is based on the idea of sharing latent functions across several convolutions. Despite the elegance of the CP construction, it provides new challenges that need yet to be tackled. First, even with a moderate number of outputs, model building is extremely prohibitive due to the huge increase in computational demands and number of parameters to be estimated. Second, the negative transfer of knowledge may occur when some outputs do not share commonalities. In this paper we address these issues. We propose a regularized pairwise modeling approach for the MGP established using CP. The key feature of our approach is to distribute the estimation of the full multivariate model into a group of bivariate GPs which are individually built. Interestingly pairwise modeling turns out to possess unique characteristics, which allows us to tackle the challenge of negative transfer through penalizing the latent function that facilitates information sharing in each bivariate model. Predictions are then made through combining predictions from the bivariate models within a Bayesian framework. The proposed method has excellent scalability when the number of outputs is large and minimizes the negative transfer of knowledge between uncorrelated outputs. Statistical guarantees for the proposed method are studied and its advantageous features are demonstrated through numerical studies.

covariance function, negative transfer, prediction, (13 more...)

1901.11512

Country:

North America > United States > Wisconsin > Dane County > Madison (0.14)
North America > United States > Michigan > Washtenaw County > Ann Arbor (0.14)
North America > United States > New York (0.04)
(3 more...)

Genre: Research Report (0.50)

Technology:

Information Technology > Modeling & Simulation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.34)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.34)

Walder, Christian J., Nock, Richard, Ong, Cheng Soon, Sugiyama, Masashi

New Tricks for Estimating Gradients of Expectations

We derive a family of Monte Carlo estimators for gradients of expectations of univariate distributions, which is related to the log-derivative trick, but involves pairwise interactions between samples. The first of these comes from either a) introducing and approximating an integral representation based on the fundamental theorem of calculus, or b) applying the reparameterisation trick to an implicit parameterisation under infinitesimal perturbation of the parameters. From the former perspective we generalise to a reproducing kernel Hilbert space representation, giving rise to locality parameter in the pairwise interactions mentioned above. The resulting estimators are unbiased and shown to offer an independent component of useful information in comparison with the log-derivative estimator. Promising analytical and numerical examples confirm the intuitions behind the new estimators.

estimator, gradient, new trick, (15 more...)

1901.11311

Country:

Asia > Middle East > Jordan (0.04)
Oceania > Australia (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
(3 more...)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Palma-Mendoza, Raul-Jose, de-Marcos, Luis, Rodriguez, Daniel, Alonso-Betanzos, Amparo

Distributed Correlation-Based Feature Selection in Spark

CFS (Correlation-Based Feature Selection) is an FS algorithm that has been successfully applied to classification problems in many domains. We describe Distributed CFS (DiCFS) as a completely redesigned, scalable, parallel and distributed version of the CFS algorithm, capable of dealing with the large volumes of data typical of big data applications. Two versions of the algorithm were implemented and compared using the Apache Spark cluster computing model, currently gaining popularity due to its much faster processing times than Hadoop's MapReduce model. We tested our algorithms on four publicly available datasets, each consisting of a large number of instances and two also consisting of a large number of features. The results show that our algorithms were superior in terms of both time-efficiency and scalability. In leveraging a computer cluster, they were able to handle larger datasets than the non-distributed WEKA version while maintaining the quality of the results, i.e., exactly the same features were returned by our algorithms when compared to the original algorithm available in WEKA.

algorithm, correlation, dataset, (11 more...)

doi: 10.1016/j.ins.2018.10.052

1901.11286

Country:

Oceania > New Zealand > North Island > Waikato (0.04)
North America > United States > District of Columbia > Washington (0.04)
North America > Honduras > Francisco Morazán > Tegucigalpa (0.04)
(3 more...)

Genre: Research Report > New Finding (0.48)

Industry: Health & Medicine (0.68)

Technology:

Information Technology > Data Science > Data Mining > Big Data (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.68)

Daily Mail - Science & techJan-30-2019, 15:11:48 GMT

A five foot tall ROBOT tour guide called Betty will lead visitors around Blenheim Palace

Sir Winston Churchill's birthplace, Blenheim Palace, is experimenting with a five-foot tall robot tour guide, called Betty. The autonomous robot is the latest in a series of tech advances in the grand stately home. Betty is designed to seek out visitors to provide information and answer their questions. It even takes selfies with visitors and can upload them to social media using the Twitter hashtag #bettyinthepalace. New addition: Blenheim Palace's new robotic tour guide wanders the halls of the stately home.

artificial intelligence, betty, blenheim palace, (12 more...)

Daily Mail - Science & tech

Country: Oceania > New Zealand > South Island > Marlborough District > Blenheim (0.92)

Industry: Consumer Products & Services > Travel (0.83)

Technology: Information Technology > Artificial Intelligence > Robots (1.00)

#artificialintelligenceJan-30-2019, 07:32:17 GMT

The Davos crowd had high-minded talk about AI, stay tuned for the action ZDNet

Working for a living will become obsolete. AI and robots will make the stuff we need and provide the services we need. The path to get there will be rocky. I don't worry about the end game. It's going to be great. But I worry about the path getting there.

action zdnet, artificial intelligence, high-minded talk, (7 more...)

#artificialintelligence

Country:

Oceania > Australia (0.19)
North America (0.17)

Technology: Information Technology > Artificial Intelligence > Robots (0.57)

arXiv.org Machine LearningJan-30-2019

An Evaluation of the Human-Interpretability of Explanation

Lage, Isaac, Chen, Emily, He, Jeffrey, Narayanan, Menaka, Kim, Been, Gershman, Sam, Doshi-Velez, Finale

Recent years have seen a boom in interest in machine learning systems that can provide a human-understandable rationale for their predictions or decisions. However, exactly what kinds of explanation are truly human-interpretable remains poorly understood. This work advances our understanding of what makes explanations interpretable under three specific tasks that users may perform with machine learning systems: simulation of the response, verification of a suggested response, and determining whether the correctness of a suggested response changes under a change to the inputs. Through carefully controlled human-subject experiments, we identify regularizers that can be used to optimize for the interpretability of machine learning systems. Our results show that the type of complexity matters: cognitive chunks (newly defined concepts) affect performance more than variable repetitions, and these trends are consistent across tasks and domains. This suggests that there may exist some common design principles for explanation systems.

experiment, explanation, response time, (15 more...)

1902.00006

Country:

South America (0.14)
North America > Canada (0.04)
Oceania > Australia (0.04)
(4 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study > Negative Result (0.46)

Industry: Health & Medicine (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.46)
(2 more...)