AITopics | Markov Models

Collaborating Authors

Markov Models

News Overviews Instructional Materials AI-Alerts Classics

On the Geometry of Message Passing Algorithms for Gaussian Reciprocal Processes

arXiv.org Machine LearningApr-6-2016

Reciprocal processes are acausal generalizations of Markov processes introduced by Bernstein in 1932. In the literature, a significant amount of attention has been focused on developing dynamical models for reciprocal processes. Recently, probabilistic graphical models for reciprocal processes have been provided. This opens the way to the application of efficient inference algorithms in the machine learning literature to solve the smoothing problem for reciprocal processes. Such algorithms are known to converge if the underlying graph is a tree. This is not the case for a reciprocal process, whose associated graphical model is a single loop network. The contribution of this paper is twofold. First, we introduce belief propagation for Gaussian reciprocal processes. Second, we establish a link between convergence analysis of belief propagation for Gaussian reciprocal processes and stability theory for differentially positive systems.

artificial intelligence, machine learning, reciprocal process, (14 more...)

arXiv.org Machine Learning

1603.09279

Country: Europe > United Kingdom (0.28)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Belief Revision (0.59)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.49)

Add feedback

Building End-To-End Dialogue Systems Using Generative Hierarchical Neural Network Models

Serban, Iulian V., Sordoni, Alessandro, Bengio, Yoshua, Courville, Aaron, Pineau, Joelle

arXiv.org Artificial IntelligenceApr-6-2016

We investigate the task of building open domain, conversational dialogue systems based on large dialogue corpora using generative models. Generative models produce system responses that are autonomously generated word-by-word, opening up the possibility for realistic, flexible interactions. In support of this goal, we extend the recently proposed hierarchical recurrent encoder-decoder neural network to the dialogue domain, and demonstrate that this model is competitive with state-of-the-art neural language models and back-off n-gram models. We investigate the limitations of this and similar approaches, and show how its performance can be improved by bootstrapping the learning from a larger question-answer pair corpus and from pretrained word embeddings.

machine learning, natural language, utterance, (20 more...)

arXiv.org Artificial Intelligence

1507.04808

Country: North America > Canada (0.28)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

Add feedback

"The Five Tribes of Machine Learning (And What You Can Learn from Each)," Pedro Domingos

#artificialintelligenceApr-5-2016, 13:01:05 GMT

There are five main schools of thought in machine learning, and each has its own master algorithm – a general-purpose learner that can in principle be applied to any domain. The symbolists have inverse deduction, the connectionists have backpropagation, the evolutionaries have genetic programming, the Bayesians have probabilistic inference, and the analogizers have support vector machines. What we really need, however, is a single algorithm combining the key features of all of them. In this webinar I will summarize the five paradigms and describe my work toward unifying them, including in particular Markov logic networks. I will conclude by speculating on the new applications that a universal learner will enable, and how society will change as a result.

artificial intelligence, machine learning, pedro domingo, (10 more...)

#artificialintelligence

Country: North America > United States > California (0.06)

Genre: Personal > Honors > Award (0.34)

Industry: Information Technology (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.58)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.58)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.38)

Add feedback

Jiaconda/Home-Security

#artificialintelligenceApr-5-2016, 01:06:28 GMT

The first step to before being able to do Image or audio analyses would be to extract relevant frames from the video streams in real time. This is crucial to a smart interactive device and requires extensive down sizing of the data to run the models on the features identified most relevant. One also needs the device to identify and react to certain events (owner coming home, break-in etc) through a frame by frame comparative analysis. Let us start with the event that there is a disturbance and the image frames and audio data is fed into the trained model to classify the event into pre-defined classes (simplest cast being intrusion vs non intrusion). Given a frame, let us start with the features that we would extract from it to first look for faces within the scenario and then if we find one, to match it with the available "registered" face repository. Given pictures of the home-owner/family, we will have an extensively pre trained model.

artificial intelligence, classification, machine learning, (15 more...)

#artificialintelligence

Industry: Information Technology > Smart Houses & Appliances (0.40)

Technology:

Information Technology > Artificial Intelligence > Vision (0.96)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.32)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.31)

Add feedback

A Latent Variable Recurrent Neural Network for Discourse Relation Language Models

Ji, Yangfeng, Haffari, Gholamreza, Eisenstein, Jacob

arXiv.org Machine LearningApr-5-2016

This paper presents a novel latent variable recurrent neural network architecture for jointly modeling sequences of words and (possibly latent) discourse relations between adjacent sentences. A recurrent neural network generates individual words, thus reaping the benefits of discriminatively-trained vector representations. The discourse relations are represented with a latent variable, which can be predicted or marginalized, depending on the task. The resulting model can therefore employ a training objective that includes not only discourse relation classification, but also word prediction. As a result, it outperforms state-of- the-art alternatives for two tasks: implicit discourse relation classification in the Penn Discourse Treebank, and dialog act classification in the Switchboard corpus. Furthermore, by marginalizing over latent discourse relations at test time, we obtain a discourse informed language model, which improves over a strong LSTM baseline.

artificial intelligence, discourse relation, machine learning, (16 more...)

arXiv.org Machine Learning

1603.01913

Country:

Europe (0.93)
North America > United States (0.68)
Asia (0.68)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.94)

Add feedback

K-Means Clustering - Lazy Programmer

#artificialintelligenceMar-25-2016, 11:40:32 GMT

K-means clustering is one of the simplest clustering algorithms one can use to find natural groupings of an unlabeled data set. Another way of stating this is that k-means clustering is an unsupervised learning algorithm. "learning the structure of X without being given Y". K-means clustering finds "k" different means (surprise surprise) which represent the centers of k clusters and assigns each data point to one of these clusters. The cluster it is assigned to is the one where the distance (usually Euclidean) from the point to the mean is smallest.

algorithm, artificial intelligence, machine learning, (5 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.35)

Add feedback

Quadratization and Roof Duality of Markov Logic Networks

de Nijs, Roderick Sebastiaan, Landsiedel, Christian, Wollherr, Dirk, Buss, Martin

Journal of Artificial Intelligence ResearchMar-25-2016

This article discusses the quadratization of Markov Logic Networks, which enables efficient approximate MAP computation by means of maximum flows. The procedure relies on a pseudo-Boolean representation of the model, and allows handling models of any order. The employed pseudo-Boolean representation can be used to identify problems that are guaranteed to be solvable in low polynomial-time. Results on common benchmark problems show that the proposed approach finds optimal assignments for most variables in excellent computational time and approximate solutions that match the quality of ILP-based solvers.

parfactor, pseudo-boolean function, quadratization, (13 more...)

Journal of Artificial Intelligence Research

doi: 10.1613/jair.5023

AI Access Foundation

10991

Journal of Artificial Intelligence Research

Country:

Europe > Germany > North Rhine-Westphalia > Upper Bavaria > Munich (0.04)
North America > United States > Illinois (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Europe > Germany > Saxony-Anhalt > Magdeburg (0.04)

Genre:

Research Report (0.48)
Overview (0.46)
Workflow (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

Add feedback

Is deep learning a Markov chain in disguise?

@machinelearnbotMar-24-2016, 23:38:45 GMT

Andrej Karpathy's post "The Unreasonable Effectiveness of Recurrent Neural Networks" made splashes last year. The basic premise is that you can create a recurrent neural network to learn language features character-by-character. But is the resultant model any different from a Markov chain built for the same purpose? I implemented a character-by-character Markov chain in R to find out. First, let's play a variation of the Imitation Game with generated text from Karpathy's tinyshakespeare dataset.

artificial intelligence, machine learning, markov chain, (8 more...)

@machinelearnbot

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.95)

Add feedback

Markov Chain Monte Carlo for Bayesian Inference - The Metropolis Algorithm - QuantStart

#artificialintelligenceMar-24-2016, 21:45:38 GMT

In previous discussions of Bayesian Inference we introduced Bayesian Statistics and considered how to infer a binomial proportion using the concept of conjugate priors. We discussed the fact that not all models can make use of conjugate priors and thus calculation of the posterior distribution would need to be approximated numerically. In this article we introduce the main family of algorithms, known collectively as Markov Chain Monte Carlo (MCMC), that allow us to approximate the posterior distribution as calculated by Bayes' Theorem. In particular, we consider the Metropolis Algorithm, which is easily stated and relatively straightforward to understand. It serves as a useful starting point when learning about MCMC before delving into more sophisticated algorithms such as Metropolis-Hastings, Gibbs Samplers and Hamiltonian Monte Carlo. Once we have described how MCMC works, we will carry it out using the open-source PyMC3 library, which takes care of many of the underlying implementation details, allowing us to concentrate on Bayesian modelling.

artificial intelligence, bayesian inference, machine learning, (12 more...)

#artificialintelligence

Industry: Banking & Finance (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.90)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.62)

Add feedback

Generating Text Using a Markov Model

@machinelearnbotMar-24-2016, 09:35:18 GMT

The generate method takes in a conditional frequency distribution. Think – how many times did each word appear after'farm'? That is what a conditional frequency distribution outputs (for all words, not just'farm'). The rest of the generate function does is output text based on the distribution observed in the training data. I did this by making an array with each word that appeared after the current word.

artificial intelligence, generating text, machine learning, (2 more...)

@machinelearnbot

Technology: Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.40)

Add feedback