Learning Discourse-level Diversity for Neural Dialog Models using Conditional Variational Autoencoders