AITopics | speech-generation breakthrough

Collaborating Authors

speech-generation breakthrough

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Google's AI Brainiacs Achieve Speech-Generation Breakthrough

#artificialintelligenceSep-12-2016, 12:05:22 GMT

WaveNet won't have immediate commercial applications because the system requires too much computational power: it has to sample the audio signal it is being trained on 16,000 times per second or more, DeepMind said. And then for each of those samples it has to form a prediction about what the soundwave should look like based on each of the prior samples. Even the DeepMind researchers acknowledged in their blog post that this "is a clearly challenging task."

artificial intelligence, machine learning, speech-generation breakthrough, (4 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

Google's DeepMind Achieves Speech-Generation Breakthrough

#artificialintelligenceSep-9-2016, 14:05:17 GMT

Google's DeepMind unit, which is working to develop super-intelligent computers, has created a system for machine-generated speech that it says outperforms existing technology by 50 percent. U.K.-based DeepMind, which Google acquired for about 400 million pounds ( 533 million) in 2014, developed an artificial intelligence called WaveNet that can mimic human speech by learning how to form the individual sound waves a human voice creates, it said in a blog post Friday. In blind tests for U.S. English and Mandarin Chinese, human listeners found WaveNet-generated speech sounded more natural than that created with any of Google's existing text-to-speech programs, which are based on different technologies. WaveNet still underperformed recordings of actual human speech. Many computer-generated speech programs work by using a large data set of short recordings of a single human speaker and then combining these speech fragments to form new words.

large language model, machine learning, natural language, (10 more...)

#artificialintelligence

Industry:

Leisure & Entertainment > Games (0.33)
Information Technology > Services (0.33)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.98)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.98)

Add feedback