Global Autoregressive Models for Data-Efficient Sequence Learning