What Comes Next? Evaluating Uncertainty in Neural Text Generators Against Human Production Variability