Google's AI can now lip read better than humans after watching thousands of hours of TV

May-25-2017, 18:50:10 GMT–#artificialintelligence

The research follows similar work published by a separate group at the University of Oxford earlier this month. Using related techniques, these scientists were able to create a lip-reading program called LipNet that achieved 93.4 percent accuracy in tests, compared to 52.3 percent human accuracy. However, LipNet was only tested on specially-recorded footage that used volunteers speaking formulaic sentences. By comparison, DeepMind's software -- known as "Watch, Listen, Attend, and Spell" -- was tested on far more challenging footage; transcribing natural, unscripted conversations from BBC politics shows. More than 5,000 hours of footage from TV shows including Newsnight, Question Time, and the World Today, was used to train DeepMind's "Watch, Listen, Attend, and Spell" program.

deepmind, footage, lip read, (5 more...)

#artificialintelligence

May-25-2017, 18:50:10 GMT

News Web Page

Add feedback

Country:
- Europe > United Kingdom > England > Oxfordshire > Oxford (0.28)

Industry:
- Media (0.41)
- Leisure & Entertainment (0.41)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (0.55)
  - Machine Learning > Neural Networks
    - Deep Learning (0.55)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found