Norm-guided latent space exploration for text-to-image generation