Make-An-Audio 2: Temporal-Enhanced Text-to-Audio Generation