Supplementary Materials

Neural Information Processing Systems 

Finally, the data was subsampled by a factor of 2. Data augmentation TX features were augmented by adding two types of artificial noise. Subsequently, random constant offsets (mean = 0 std = 0.6) were added to the means of the Each session day has its own affine transform layer. RNN training hyperparameters The hyperparameters for RNN training are listed in Table 1. It used a 130,000 word vocabulary taken from the CMU Pronouncing Dictionary [1]. Out-of-vocabulary words were mapped to a special token.