AITopics

1712.05497

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.50)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.50)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.35)
Information Technology > Artificial Intelligence > Robots > Humanoid Robots (0.34)

Anaraki, Javad Rahimipour, Samet, Saeed, Eftekhari, Mahdi, Ahn, Chang Wook

A Fuzzy-Rough based Binary Shuffled Frog Leaping Algorithm for Feature Selection

arXiv.org Artificial IntelligenceJul-31-2018

Feature selection and attribute reduction are crucial problems, and widely used techniques in the field of machine learning, data mining and pattern recognition to overcome the well-known phenomenon of the Curse of Dimensionality, by either selecting a subset of features or removing unrelated ones. This paper presents a new feature selection method that efficiently carries out attribute reduction, thereby selecting the most informative features of a dataset. It consists of two components: 1) a measure for feature subset evaluation, and 2) a search strategy. For the evaluation measure, we have employed the fuzzy-rough dependency degree (FRFDD) in the lower approximation-based fuzzy-rough feature selection (L-FRFS) due to its effectiveness in feature selection. As for the search strategy, a new version of a binary shuffled frog leaping algorithm is proposed (B-SFLA). The new feature selection method is obtained by hybridizing the B-SFLA with the FRDD. Non-parametric statistical tests are conducted to compare the proposed approach with several existing methods over twenty two datasets, including nine high dimensional and large ones, from the UCI repository. The experimental results demonstrate that the B-SFLA approach significantly outperforms other metaheuristic methods in terms of the number of selected features and the classification accuracy.

artificial intelligence, evolutionary algorithm, machine learning, (16 more...)

1808.00068

Country:

North America > United States > Wisconsin (0.05)
North America > United States > New York (0.04)
North America > United States > New Jersey > Hudson County > Secaucus (0.04)
(5 more...)

Genre: Research Report > New Finding (0.34)

Industry: Health & Medicine > Therapeutic Area (0.96)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
(2 more...)

Henter, Gustav Eje, Leijon, Arne, Kleijn, W. Bastiaan

Kernel Density Estimation-Based Markov Models with Hidden State

arXiv.org Machine LearningJul-30-2018

We consider Markov models of stochastic processes where the next-step conditional distribution is defined by a kernel density estimator (KDE), similar to Markov forecast densities and certain time-series bootstrap schemes. The KDE Markov models (KDE-MMs) we discuss are nonlinear, nonparametric, fully probabilistic representations of stationary processes, based on techniques with strong asymptotic consistency properties. The models generate new data by concatenating points from the training data sequences in a context-sensitive manner, together with some additive driving noise. We present novel EM-type maximum-likelihood algorithms for data-driven bandwidth selection in KDE-MMs. Additionally, we augment the KDE-MMs with a hidden state, yielding a new model class, KDE-HMMs. The added state variable captures non-Markovian long memory and signal structure (e.g., slow oscillations), complementing the short-range dependences described by the Markov process. The resulting joint Markov and hidden-Markov structure is appealing for modelling complex real-world processes such as speech signals. We present guaranteed-ascent EM-update equations for model parameters in the case of Gaussian kernels, as well as relaxed update formulas that greatly accelerate training in practice. Experiments demonstrate increased held-out set probability for KDE-HMMs on several challenging natural and synthetic data series, compared to traditional techniques such as autoregressive models, HMMs, and their combinations.

artificial intelligence, kde-hmm, machine learning, (17 more...)

1807.1132

Country:

Europe > Sweden > Stockholm > Stockholm (0.04)
Europe > United Kingdom > England > Greater London > London (0.04)
Oceania > New Zealand > North Island > Wellington Region > Wellington (0.04)
(6 more...)

Genre: Research Report (1.00)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

Henter, Gustav Eje, Wang, Xin, Yamagishi, Junichi

Deep Encoder-Decoder Models for Unsupervised Learning of Controllable Speech Synthesis

arXiv.org Machine LearningJul-30-2018

Generating versatile and appropriate synthetic speech requires control over the output expression separate from the spoken text. Important non-textual speech variation is seldom annotated, in which case output control must be learned in an unsupervised fashion. In this paper, we perform an in-depth study of methods for unsupervised learning of control in statistical speech synthesis. For example, we show that popular unsupervised training heuristics can be interpreted as variational inference in certain autoencoder models. We additionally connect these models to VQ-VAEs, another, recently-proposed class of deep variational autoencoders, which we show can be derived from a very similar mathematical argument. The implications of these new probabilistic interpretations are discussed. We illustrate the utility of the various approaches with an application to emotional speech synthesis, where the unsupervised methods for learning expression control (without access to emotional labels) are found to give results that in many aspects match or surpass the previous best supervised approach.

artificial intelligence, bayesian inference, machine learning, (18 more...)

1807.1147

Country:

North America > United States > New York > New York County > New York City (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Sweden > Stockholm > Stockholm (0.04)
(2 more...)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
(5 more...)

Chua, Jeroen, Felzenszwalb, Pedro F.

Scene Grammars, Factor Graphs, and Belief Propagation

arXiv.org Artificial IntelligenceJul-30-2018

We describe a general framework for probabilistic modeling of complex scenes and inference from ambiguous observations. The approach is motivated by applications in image analysis and is based on the use of priors defined by stochastic grammars. We define a class of grammars that capture relationships between the objects in a scene and provide important contextual cues for statistical inference. The distribution over scenes defined by a probabilistic scene grammar can be represented by a graphical model and this construction can be used for efficient inference with loopy belief propagation. We show experimental results with two different applications. One application involves the reconstruction of binary contour maps. Another application involves detecting and localizing faces in images. In both applications the same framework leads to robust inference algorithms that can effectively combine local information to reason about a scene.

artificial intelligence, machine learning, natural language, (17 more...)

1606.01307

Country:

North America > United States > Rhode Island > Providence County > Providence (0.04)
Asia > Middle East > Jordan (0.04)
North America > United States > Massachusetts > Hampshire County > Amherst (0.04)
(4 more...)

Genre: Research Report (0.81)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Yin, Mingzhang, Zhou, Mingyuan

ARM: Augment-REINFORCE-Merge Gradient for Discrete Latent Variable Models

arXiv.org Machine LearningJul-29-2018

To backpropagate the gradients through discrete stochastic layers, we encode the true gradients into a multiplication between random noises and the difference of the same function of two different sets of discrete latent variables, which are correlated with these random noises. The expectations of that multiplication over iterations are zeros combined with spikes from time to time. To modulate the frequencies, amplitudes, and signs of the spikes to capture the temporal evolution of the true gradients, we propose the augment-REINFORCE-merge (ARM) estimator that combines data augmentation, the score-function estimator, permutation of the indices of latent variables, and variance reduction for Monte Carlo integration using common random numbers. The ARM estimator provides low-variance and unbiased gradient estimates for the parameters of discrete distributions, leading to state-of-the-art performance in both auto-encoding variational Bayes and maximum likelihood inference, for discrete latent variable models with one or multiple discrete stochastic layers.

artificial intelligence, bayesian inference, machine learning, (16 more...)

1807.11143

Country:

North America > United States > Texas > Travis County > Austin (0.14)
Asia > Middle East > Jordan (0.04)
North America > United States > New York (0.04)
(2 more...)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.48)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.48)

arXiv.org Machine LearningJul-29-2018

A Margin-based MLE for Crowdsourced Partial Ranking

Xu, Qianqian, Xiong, Jiechao, Sun, Xinwei, Yang, Zhiyong, Cao, Xiaochun, Huang, Qingming, Yao, Yuan

A preference order or ranking aggregated from pairwise comparison data is commonly understood as a strict total order. However, in real-world scenarios, some items are intrinsically ambiguous in comparisons, which may very well be an inherent uncertainty of the data. In this case, the conventional total order ranking can not capture such uncertainty with mere global ranking or utility scores. In this paper, we are specifically interested in the recent surge in crowdsourcing applications to predict partial but more accurate (i.e., making less incorrect statements) orders rather than complete ones. To do so, we propose a novel framework to learn some probabilistic models of partial orders as a \emph{margin-based Maximum Likelihood Estimate} (MLE) method. We prove that the induced MLE is a joint convex optimization problem with respect to all the parameters, including the global ranking scores and margin parameter. Moreover, three kinds of generalized linear models are studied, including the basic uniform model, Bradley-Terry model, and Thurstone-Mosteller model, equipped with some theoretical analysis on FDR and Power control for the proposed methods. The validity of these models are supported by experiments with both simulated and real-world datasets, which shows that the proposed models exhibit improvements compared with traditional state-of-the-art algorithms.

artificial intelligence, machine learning, social media, (18 more...)

doi: 10.1145/3240508.3240597

1807.11014

Country:

Asia > China > Beijing > Beijing (0.05)
Asia > China > Hong Kong (0.04)
North America > United States > New York > New York County > New York City (0.04)
(9 more...)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Communications > Social Media > Crowdsourcing (0.72)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.54)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.47)

Nguyen, L. H., Pham, N. T. H., Ngo, V. M.

Opinion Spam Recognition Method for Online Reviews using Ontological Features

arXiv.org Artificial IntelligenceJul-29-2018

Reviews of a product are defined as the individual assessment of the product or service 1. Reviews must contain information about quality, or characteristics of the product. The reviews have become a good resource for decision making. In recent years, along with web spam 19, 22, email spam 23, 10 and blog spam 20, 18, review spam detection has attracted attention from research community 11, 14. Reviews on products are very important for both sellers and buyers in purchasing online. Customers who use the service from e-commerce websites will reference information from other customers through these reviews and make the best decision when they intend to buy a product.

machine learning, natural language, spam filtering, (18 more...)

1807.11024

Country: North America > United States > Illinois > Cook County > Chicago (0.04)

Genre: Research Report (1.00)

Industry: Information Technology > Services > e-Commerce Services (0.55)

Technology:

Information Technology > Information Management (1.00)
Information Technology > Communications > Web (1.00)
Information Technology > Security & Privacy > Spam Filtering (0.90)
(4 more...)

Broelemann, Klaus, Kasneci, Gjergji

Combining Restricted Boltzmann Machines with Neural Networks for Latent Truth Discovery

arXiv.org Artificial IntelligenceJul-27-2018

Latent truth discovery, LTD for short, refers to the problem of aggregating ltiple claims from various sources in order to estimate the plausibility of atements about entities. In the absence of a ground truth, this problem is highly challenging, when some sources provide conflicting claims and others no claims at all. In this work we provide an unsupervised stochastic inference procedure on top of a model that combines restricted Boltzmann machines with feed-forward neural networks to accurately infer the reliability of sources as well as the plausibility of statements about entities. In comparison to prior work our approach stands out (1) by allowing the incorporation of arbitrary features about sources and claims, (2) by generalizing from reliability per source towards a reliability function, and thus (3) enabling the estimation of source reliability even for sources that have provided no or very few claims, (4) by building on efficient and scalable stochastic inference algorithms, and (5) by outperforming the state-of-the-art by a considerable margin.

artificial intelligence, bayesian inference, machine learning, (17 more...)

1807.1068

Country:

Europe > Germany > Hesse > Darmstadt Region > Wiesbaden (0.04)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > Illinois (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.62)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.62)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.48)
(2 more...)

ScienceJul-26-2018, 18:24:58 GMT

Response to Comment on "An excess of massive stars in the local 30 Doradus starburst"

Farr and Mandel reanalyze our data, finding initial mass function slopes for high-mass stars in 30 Doradus that agree with our results. However, their reanalysis appears to underpredict the observed number of massive stars. Their technique results in more precise slopes than in our work, strengthening our conclusion that there is an excess of massive stars ( 30 solar masses) in 30 Doradus. Farr and Mandel (1) reanalyzed the results of our study (2), in which we investigated the star formation history (SFH) and stellar initial mass function (IMF) of the local 30 Doradus (30 Dor) starburst in the Large Magellanic Cloud and found an overabundance of stars with initial mass exceeding 30 solar masses (M). They use an alternative and potentially more powerful statistical framework, hierarchical Bayesian inference, and infer IMF power-law indices for massive stars that are in agreement with our results (compare the IMF slope distributions in their figure 1 to the 1σ range inferred in our analysis).

artificial intelligence, bayesian inference, massive star, (12 more...)

Science

Genre: Research Report > New Finding (0.76)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.35)