Read my lips: New technology spells out what's said when audio fails - Press Release - UEA
New lip-reading technology developed at the University of East Anglia could help in solving crimes and provide communication assistance for people with hearing and speech impairments. The visual speech recognition technology, created by Dr Helen L. Bear and Prof Richard Harvey of UEA's School of Computing Sciences, can be applied "any place where the audio isn't good enough to determine what people are saying," Dr Bear said. Dr Bear, whose findings will be presented at the International Conference on Acoustics, Speech and Signal Processing (ICASSP) in Shanghai on March 25, said unique problems with determining speech arise when sound isn't available – such as on CCTV footage – or if the audio is inadequate and there aren't clues to give the context of a conversation. The sounds '/p/,' '/b/,' and '/m/' all look similar on the lips, but now the machine lip-reading classification technology can differentiate between the sounds for a more accurate translation. Dr Bear said: "We are still learning the science of visual speech and what it is people need to know to create a fool-proof recognition model for lip-reading, but this classification system improves upon previous lip-reading methods by using a novel training method for the classifiers. "Potentially, a robust lip-reading system could be applied in a number of situations, from criminal investigations to entertainment.
Mar-27-2016, 02:05:27 GMT
- Genre:
- Press Release (0.40)
- Industry:
- Education (0.58)
- Technology:
- Information Technology > Artificial Intelligence
- Speech (0.61)
- Machine Learning (0.38)
- Information Technology > Artificial Intelligence