If AI image generators are so smart, why do they struggle to write and count?
AI image produced using the prompt'hyper-realistic ten hands on a picture with text saying hello'. Generative AI tools such as Midjourney, Stable Diffusion and DALL-E 2 have astounded us with their ability to produce remarkable images in a matter of seconds. Despite their achievements, however, there remains a puzzling disparity between what AI image generators can produce and what we can. For instance, these tools often won't deliver satisfactory results for seemingly simple tasks such as counting objects and producing accurate text. If generative AI has reached such unprecedented heights in creative expression, why does it struggle with tasks even a primary school student could complete? Exploring the underlying reasons helps sheds light on the complex numerical nature of AI, and the nuance of its capabilities.
Jul-17-2023, 09:55:53 GMT