AITopics | visual speechreading

Collaborating Authors

visual speechreading

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Dynamic Features for Visual Speechreading: A Systematic Comparison

Neural Information Processing SystemsApr-6-2023, 18:11:39 GMT

Humans use visual as well as auditory speech signals to recognize spoken words. A variety of systems have been investigated for per(cid:173) forming this task. The main purpose of this research was to sys(cid:173) tematically compare the performance of a range of dynamic visual features on a speechreading task. We have found that normal(cid:173) ization of images to eliminate variation due to translation, scale, and planar rotation yielded substantial improvements in general(cid:173) ization performance regardless of the visual representation used. In addition, the dynamic information in the difference between suc(cid:173) cessive frames yielded better performance than optical-flow based approaches, and compression by local low-pass filtering worked sur(cid:173) prisingly better than global principal components analysis (PCA).

dynamic feature, systematic comparison, visual speechreading, (1 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Vision (0.66)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (0.30)

Add feedback

Dynamic Features for Visual Speechreading: A Systematic Comparison

Gray, Michael S., Movellan, Javier R., Sejnowski, Terrence J.

Neural Information Processing SystemsDec-31-1997

Humans use visual as well as auditory speech signals to recognize spoken words. A variety of systems have been investigated for performing this task. The main purpose of this research was to systematically compare the performance of a range of dynamic visual features on a speechreading task. We have found that normalization of images to eliminate variation due to translation, scale, and planar rotation yielded substantial improvements in generalization performance regardless of the visual representation used. In addition, the dynamic information in the difference between successive frames yielded better performance than optical-flow based approaches, and compression by local low-pass filtering worked surprisingly better than global principal components analysis (PCA). These results are examined and possible explanations are explored.

information, representation, visual information, (12 more...)

Neural Information Processing Systems

Country:

North America > United States > California > San Diego County > San Diego (0.07)
North America > United States > California > San Diego County > La Jolla (0.05)

Technology:

Information Technology > Artificial Intelligence > Vision (0.96)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.33)

Add feedback

Dynamic Features for Visual Speechreading: A Systematic Comparison

Gray, Michael S., Movellan, Javier R., Sejnowski, Terrence J.

Neural Information Processing SystemsDec-31-1997

information, representation, visual information, (12 more...)

Neural Information Processing Systems

Country:

North America > United States > California > San Diego County > San Diego (0.07)
North America > United States > California > San Diego County > La Jolla (0.05)

Technology:

Information Technology > Artificial Intelligence > Vision (0.96)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.33)

Add feedback

Dynamic Features for Visual Speechreading: A Systematic Comparison

Gray, Michael S., Movellan, Javier R., Sejnowski, Terrence J.

Neural Information Processing SystemsDec-31-1997

Humans use visual as well as auditory speech signals to recognize spoken words. A variety of systems have been investigated for performing thistask. The main purpose of this research was to systematically comparethe performance of a range of dynamic visual features on a speechreading task. We have found that normalization ofimages to eliminate variation due to translation, scale, and planar rotation yielded substantial improvements in generalization performanceregardless of the visual representation used. In addition, the dynamic information in the difference between successive framesyielded better performance than optical-flow based approaches, and compression by local low-pass filtering worked surprisingly betterthan global principal components analysis (PCA). These results are examined and possible explanations are explored.

artificial intelligence, information, machine learning, (15 more...)

Neural Information Processing Systems

Country: North America > United States > California > San Diego County (0.18)

Technology:

Information Technology > Artificial Intelligence > Vision (0.96)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.33)

Add feedback