Beyond Lipreading: Visual Speech Recognition Looks You in the Eye

Mar-27-2020, 12:34:34 GMT–#artificialintelligence

Like the lipreading spies of yesteryear peering through their binoculars, almost all visual speech recognition VSR research these days focuses on mouth and lip motion. But a new study suggests that VSR models could perform even better if they used additional available visual information. The VSR field typically looks at the mouth region since it is believed that lip shape and motion contain almost all the information correlated with speech. This has made the information in other facial regions considered as weak by default. But a new paper from the Key Laboratory of Intelligent Information Processing of the Chinese Academy of Sciences and the University of Chinese Academy of Sciences proposes that information from extraoral facial regions can consistently benefit SOTA VSR model performance.

dataset, facial region, visual speech recognition look, (5 more...)

#artificialintelligence

Mar-27-2020, 12:34:34 GMT

News Web Page

Add feedback

Genre:
- Research Report
  - New Finding (0.52)
  - Experimental Study (0.37)

Technology:
- Information Technology > Artificial Intelligence > Speech > Speech Recognition (0.76)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found