A Simple Explanation of A Spectral Algorithm for Learning Hidden Markov Models
This document is my summary explanation of the algorithm in "A Spectral Algorithm for Learning Hidden Markov Models" (COLT 2009), though there may be some slight notational inconsistencies with the original paper. The exposition and the math here are quite different, so if you don't like this explanation, try the original paper! The idea is to maintain output predictions in a recursive inference algorithm, instead of the usual method of maintaining hidden state predictions, and to represent the HMM only in terms of the maps necessary to update output predictions given new data. This approach limits the inference computations the algorithm can perform (it can't answer any queries about the hidden states since it doesn't explicitly deal with them at all), but it also reduces the complexity of the model parameters that are learned and thus makes learning easier. The learning algorithm uses an SVD and matrix operations, so it avoids the local-optima problems of EM or any other algorithms based on maximizing data likelihood over the usual HMM parameterization.
Apr-11-2012