Tango 2: Aligning Diffusion-based Text-to-Audio Generations through Direct Preference Optimization