AITopics | Markov Models

Collaborating Authors

Markov Models

News Overviews Instructional Materials AI-Alerts Classics

MIT OpenCourseWare Electrical Engineering and Computer Science 6.881 Natural Language Processing, Fall 2004

AITopics Original LinksJan-18-2017, 10:04:57 GMT

The class will cover models at the level of syntactic, semantic and discourse processing. The emphasis will be on corpus-based methods and algorithms, such as Hidden Markov Models and probabilistic context free grammars. We will discuss the use of these methods and models in a variety of applications including syntactic parsing, information extraction, statistical machine translation, and summarization. File decompression software, such as Winzip or StuffIt, is required to open the .gz Postscript viewer software, such as Ghostscript/Ghostview, can be used to view the .ps

artificial intelligence, machine learning, mit opencourseware electrical engineering, (5 more...)

AITopics Original Links

Country: North America > United States > Massachusetts > Middlesex County > Cambridge (0.40)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

Add feedback

CSDL - IEEE Intelligent Systems - Table of Contents

AITopics Original LinksJan-18-2017, 10:04:02 GMT

bioinformatics, ieee intelligent system, machine learning, (2 more...)

AITopics Original Links

Genre: Collection (0.60)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.87)

Technology:

Information Technology > Biomedical Informatics > Translational Bioinformatics (0.87)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.59)

Add feedback

A Survey of Robotic Musicianship

AITopics Original LinksJan-18-2017, 10:02:59 GMT

Using humanoid robots to study human behavior.

artificial intelligence, machine learning, simulation of human behavior, (17 more...)

AITopics Original Links

Country:

Asia > Singapore (0.04)
North America > United States > Massachusetts (0.04)
Europe > Slovenia > Central Slovenia > Municipality of Ljubljana > Ljubljana (0.04)
Europe > Denmark > Capital Region > Copenhagen (0.04)

Genre: Overview (0.50)

Industry:

Media > Music (1.00)
Leisure & Entertainment (1.00)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Rule-Based Reasoning (0.46)
Information Technology > Artificial Intelligence > Cognitive Science > Simulation of Human Behavior (0.46)

Add feedback

Converting Cascade-Correlation Neural Nets into Probabilistic Generative Models

Nobandegani, Ardavan Salehi, Shultz, Thomas R.

arXiv.org Machine LearningJan-18-2017

Humans are not only adept in recognizing what class an input instance belongs to (i.e., classification task), but perhaps more remarkably, they can imagine (i.e., generate) plausible instances of a desired class with ease, when prompted. Inspired by this, we propose a framework which allows transforming Cascade-Correlation Neural Networks (CCNNs) into probabilistic generative models, thereby enabling CCNNs to generate samples from a category of interest. CCNNs are a well-known class of deterministic, discriminative NNs, which autonomously construct their topology, and have been successful in giving accounts for a variety of psychological phenomena. Our proposed framework is based on a Markov Chain Monte Carlo (MCMC) method, called the Metropolis-adjusted Langevin algorithm, which capitalizes on the gradient information of the target distribution to direct its explorations towards regions of high probability, thereby achieving good mixing properties. Through extensive simulations, we demonstrate the efficacy of our proposed framework.

artificial intelligence, category, machine learning, (19 more...)

arXiv.org Machine Learning

1701.05004

Country: North America > United States (0.28)

Genre: Research Report (0.65)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.34)

Add feedback

5 things you need to know about A.I.: Cognitive, neural and deep, oh my!

#artificialintelligenceJan-13-2017, 05:25:16 GMT

There's never any shortage of buzzwords in the IT world, but when it comes to A.I., they can be hard to tell apart. Here are five things you need to understand. Artificial intelligence refers to "a broad set of methods, algorithms and technologies that make software'smart' in a way that may seem human-like to an outside observer," said Lynne Parker, director of the division of Information and Intelligent Systems for the National Science Foundation. Machine learning, computer vision, natural language processing, robotics and related topics are all part of A.I., in other words. "Some people may come up with distinctions between the two, but there is not a universal view that the two terms mean anything different," Parker said.

artificial intelligence, learning, machine learning, (12 more...)

#artificialintelligence

Country:

North America > United States > Oregon (0.05)
North America > Canada (0.05)
Europe (0.05)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.31)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.31)

Add feedback

Bayesian Non-Homogeneous Markov Models via Polya-Gamma Data Augmentation with Applications to Rainfall Modeling

Holsclaw, Tracy, Greene, Arthur M., Robertson, Andrew W., Smyth, Padhraic

arXiv.org Machine LearningJan-12-2017

Discrete-time hidden Markov models are a broadly useful class of latent-variable models with applications in areas such as speech recognition, bioinformatics, and climate data analysis. It is common in practice to introduce temporal non-homogeneity into such models by making the transition probabilities dependent on time-varying exogenous input variables via a multinomial logistic parametrization. We extend such models to introduce additional non-homogeneity into the emission distribution using a generalized linear model (GLM), with data augmentation for sampling-based inference. However, the presence of the logistic function in the state transition model significantly complicates parameter inference for the overall model, particularly in a Bayesian context. To address this we extend the recently-proposed Polya-Gamma data augmentation approach to handle non-homogeneous hidden Markov models (NHMMs), allowing the development of an efficient Markov chain Monte Carlo (MCMC) sampling scheme. We apply our model and inference scheme to 30 years of daily rainfall in India, leading to a number of insights into rainfall-related phenomena in the region. Our proposed approach allows for fully Bayesian analysis of relatively complex NHMMs on a scale that was not possible with previous methods. Software implementing the methods described in the paper is available via the R package NHMM.

artificial intelligence, machine learning, rainfall, (18 more...)

arXiv.org Machine Learning

1701.02856

Country:

Asia (0.89)
North America > United States > California (0.28)

Genre: Research Report (1.00)

Industry:

Energy (0.93)
Government > Regional Government > North America Government > United States Government (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

Add feedback

Improving Sampling from Generative Autoencoders with Markov Chains

Creswell, Antonia, Arulkumaran, Kai, Bharath, Anil Anthony

arXiv.org Machine LearningJan-12-2017

We focus on generative autoencoders, such as variational or adversarial autoencoders, which jointly learn a generative model alongside an inference model. Generative autoencoders are those which are trained to softly enforce a prior on the latent distribution learned by the inference model. We call the distribution to which the inference model maps observed samples, the learned latent distribution, which may not be consistent with the prior. We formulate a Markov chain Monte Carlo (MCMC) sampling process, equivalent to iteratively decoding and encoding, which allows us to sample from the learned latent distribution. Since, the generative model learns to map from the learned latent distribution, rather than the prior, we may use MCMC to improve the quality of samples drawn from the generative model, especially when the learned latent distribution is far from the prior. Using MCMC sampling, we are able to reveal previously unseen differences between generative autoencoders trained either with or without a denoising criterion.

artificial intelligence, autoencoder, machine learning, (17 more...)

arXiv.org Machine Learning

1610.09296

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.62)

Add feedback

Towards End-to-End Speech Recognition with Deep Convolutional Neural Networks

Zhang, Ying, Pezeshki, Mohammad, Brakel, Philemon, Zhang, Saizheng, Bengio, Cesar Laurent Yoshua, Courville, Aaron

arXiv.org Machine LearningJan-10-2017

Convolutional Neural Networks (CNNs) are effective models for reducing spectral variations and modeling spectral correlations in acoustic features for automatic speech recognition (ASR). Hybrid speech recognition systems incorporating CNNs with Hidden Markov Models/Gaussian Mixture Models (HMMs/GMMs) have achieved the state-of-the-art in various benchmarks. Meanwhile, Connectionist Temporal Classification (CTC) with Recurrent Neural Networks (RNNs), which is proposed for labeling unsegmented sequences, makes it feasible to train an'end-to-end' speech recognition system instead of hybrid settings. However, RNNs are computationally expensive and sometimes difficult to train. In this paper, inspired by the advantages of both CNNs and the CTC approach, we propose an end-to-end speech framework for sequence labeling, by combining hierarchical CNNs with CTC directly without recurrent connections. By evaluating the approach on the TIMIT phoneme recognition task, we show that the proposed model is not only computationally efficient, but also competitive with the existing baseline systems. Moreover, we argue that CNNs have the capability to model temporal correlations with appropriate context information.

artificial intelligence, machine learning, sequence, (14 more...)

arXiv.org Machine Learning

1701.0272

Country: North America > Canada > Quebec (0.14)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

Add feedback

Artificial intelligence

#artificialintelligenceJan-9-2017, 15:10:04 GMT

Major AI researchers and textbooks define the field as "the study and design of intelligent agents", where an intelligent agent is a system that perceives its environment and takes actions that maximize its chances of success. John McCarthy, who coined the term in 1955, defines it as "The science and engineering of making intelligent machines". AI research is highly technical and specialized, deeply divided into subfields that often fail to communicate with each other. Some of the division is due to social and cultural factors: subfields have grown up around particular institutions and the work of individual researchers. AI research is also divided by several technical issues.

different approach measure machine intelligence, machine learning, management topic artificial intelligence submitted, (17 more...)

#artificialintelligence

Country:

North America > United States > New York (0.04)
North America > United States > New Jersey (0.04)
Europe > Greece (0.04)
(7 more...)

Industry:

Media > Film (1.00)
Information Technology (1.00)
Health & Medicine > Therapeutic Area (0.93)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > History (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

Add feedback

Learning Sparse Structural Changes in High-dimensional Markov Networks: A Review on Methodologies and Theories

Liu, Song, Fukumizu, Kenji, Suzuki, Taiji

arXiv.org Machine LearningJan-9-2017

For example, genes may regulate each other in different ways when external conditions are changed; the number of daily flu-like symptom reports in nearby hospitals may become correlated when a major epidemic disease breaks out; EEG signals from different regions of the brain may be synchronized/desynchronized when the subject is performing different activities. Spotting such changes in interactions may provide key insights into the underlying system. The interactions among random variables can be formulated as undirected probabilistic graphical models, or Markov Networks (MNs) [Koller and Friedman, 2009], expressing the interactions via the conditional independence. We consider a simple model: the pairwise MNs where the links are only encoded for single or pairs of random variables. Due to the Hammersley-Clifford theorem [Hammersley and Clifford, 1971], the underlying joint probability density function can be represented as the product of univariate and bivariate factors.

artificial intelligence, kliep, machine learning, (17 more...)

arXiv.org Machine Learning

1701.01582

Country: Europe > Austria (0.28)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.71)

Add feedback