7 Supplementary Material 7.1 Conditional Kalman Filter equations As mentioned in Sec. 2.3, the first term in Eq

Neural Information Processing Systems 

For all datasets, the same model and training hyper-parameters (except learning rate) were used.