Improving Diversity in Language Models: When Temperature Fails, Change the Loss