Goto

Collaborating Authors

 Markov Models






FlowHMM: Flow-based continuous hidden Markov models

Neural Information Processing Systems

Continuous hidden Markov models (HMMs) assume that observations are generated from a mixture of Gaussian densities, limiting their ability to model more complex distributions.



Tractable Latent State Inference for Hidden Continuous-Time semi-Markov Chains Supplement

Neural Information Processing Systems

We will first replicate an equation similar to (20) for the backward case. The derivation is similar to that of the forward equation, so that it uses a combination of equations (16), (18) and (19) while leaving out the observation likelihood function. The combination is again carried out using the Laplace transform.



A Provably Efficient Sample Collection Strategy for Reinforcement Learning

Neural Information Processing Systems

One of the challenges in online reinforcement learning (RL) is that the agent needs to trade off the exploration of the environment and the exploitation of the samples to optimize its behavior. Whether we optimize for regret, sample complexity, state-space coverage or model estimation, we need to strike a different exploration-exploitation trade-off.


A Provably Efficient Sample Collection Strategy for Reinforcement Learning

Neural Information Processing Systems

One of the challenges in online reinforcement learning (RL) is that the agent needs to trade off the exploration of the environment and the exploitation of the samples to optimize its behavior. Whether we optimize for regret, sample complexity, state-space coverage or model estimation, we need to strike a different exploration-exploitation trade-off.