A Comparison of Discrete Latent Variable Models for Speech Representation Learning