Jointly Optimizing Diversity and Relevance in Neural Response Generation