Jointly Reinforcing Diversity and Quality in Language Model Generations