AI chatbots miss urgent issues in queries about women's health

New Scientist 

AI chatbots miss urgent issues in queries about women's health AI models such as ChatGPT and Gemini fail to give adequate advice for 60 per cent of queries relating to women's health in a test created by medical professionals Many women are using AI for health information, but the answers aren't always up to scratch Commonly used AI models fail to accurately diagnose or offer advice for many queries relating to women's health that require urgent attention. Thirteen large language models, produced by the likes of OpenAI, Google, Anthropic, Mistral AI and xAI, were given 345 medical queries across five specialities, including emergency medicine, gynaecology and neurology. The queries were written by 17 women's health researchers, pharmacists and clinicians from the US and Europe. The answers were reviewed by the same experts. Any questions that the models failed at were collated into a benchmarking test of AI models' medical expertise that included 96 queries.