Training-Free Consistent Text-to-Image Generation