Representation learning for neural population activity with Neural Data Transformers