Minimax Learning of Ergodic Markov Chains
Wolfer, Geoffrey, Kontorovich, Aryeh
We compute the finite-sample minimax (modulo logarithmic factors) sample complexity of learning the parameters of a finite Markov chain from a single long sequence of states. Our error metric is a natural variant of total variation. The sample complexity necessarily depends on the spectral gap and minimal stationary probability of the unknown chain - for which, at least in the reversible case, there are known finite-sample estimators with fully empirical confidence intervals. To our knowledge, this is the first PAC-type result with nearly matching (up to logs) upper and lower bounds for learning, in any metric in the context of Markov chains.
Sep-13-2018
- Country:
- North America
- United States > New York (0.04)
- Canada > Quebec
- Montreal (0.04)
- Europe
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- France > Île-de-France
- United Kingdom > England
- Asia > Middle East
- Israel > Southern District > Beer-Sheva (0.04)
- North America
- Genre:
- Research Report (0.50)