speech recognition


Tools for Making Machine Learning Easier and Smoother

#artificialintelligence

The computer vision, speech recognition, natural language processing, and audio recognition applications being developed using DL techniques need large amounts of computational power to process large amounts of data. There are three types of ML: supervised machine learning, unsupervised machine learning, and reinforcement learning. Another interesting example is Google DeepMind, which used DL techniques in AlphaGo, a computer program developed to play the board game Go. Using one of the world's most popular computer games, the developers of the project are creating a research environment open to artificial intelligence and machine learning researchers around the world.


How to use Watson Speech to Text utilities to increase accuracy - Artificial Intelligence

@machinelearnbot

I thought I would take a moment to play with Watson Speech to Text and a utility that was released a few months ago. So the purpose of asking about a puppy is that I have a sample conversation system that is about buying a dog. Learn how to use Watson Speech to Text API to increase your accuracy. We've included links S2T utilities download links and sample .wav I thought I would take a moment to play with Watson Speech to Text and a utility that was released a few months ago.


Inside the black box: Understanding AI decision-making ZDNet

#artificialintelligence

Neural networks, machine-learning systems, predictive analytics, speech recognition, natural-language understanding and other components of what's broadly defined as'artificial intelligence' (AI) are currently undergoing a boom: research is progressing apace, media attention is at an all-time high, and organisations are increasingly implementing AI solutions in pursuit of automation-driven efficiencies. Neural networks are a particular concern not only because they are a key component of many AI applications -- including image recognition, speech recognition, natural language understanding and machine translation -- but also because they're something of a'black box' when it comes to elucidating exactly how their results are generated. This'black box' problem was addressed in a recent paper from MIT's Computer Science and Artificial Intelligence Laboratory (CSAIL), which examined neural networks trained on text-based data using a system comprising two modules -- a'generator' and an'encoder'. Many people -- including Stephen Hawking, Elon Musk and leading AI researchers -- have expressed concerns about how AI might develop, leading to the creation of organisations like Open AI and Partnership on AI aimed at avoiding potential pitfalls.


Flipboard on Flipboard

#artificialintelligence

Dictate, a new project from Microsoft's experimental R&D group, Microsoft Garage, is launching today to offer a way to type using your voice in Office programs including Outlook, Word and PowerPoint. Available as an add-in for Microsoft's software, Dictate is powered by the same speech recognition technology that Cortana uses in order to convert your speech to text. An introductory video posted this morning to YouTube offers a preview of how the software works in Word, PowerPoint, and Outlook. It also at launch supports more than 20 languages for dictation, and can translate in real-time into 60 languages.


Finance is about to undergo an A.I. Revolution - ansarada

#artificialintelligence

Eurekahedge, a hedge fund data base, said it monitored 23 hedge funds that rely on A.I. In March this year Blackrock, the world's biggest asset manager, said it was relying more on computers to pick stocks. Mr. Ferrucci previously led IBM's development of the Watson computer, a question answering computer system capable of answering questions in a natural language.


Open Source Toolkits for Speech Recognition

@machinelearnbot

Typically, this consists of n-gram language models combined with Hidden Markov models (HMM). This article reviews the main options for free speech recognition toolkits that use traditional HMM and n-gram language models. However, Kaldi does cover both the phonetic and deep learning approaches to speech recognition. We didn't dig as deeply into the other three packages, but they all come with at least simple models or appear to be compatible with the format provided on the VoxForge site, a fairly active crowdsourced repository of speech recognition data and trained models.


How to use Watson Speech to Text utilities to increase accuracy - Watson

#artificialintelligence

June 23, 2017 Written by: Simon O'Doherty Key Points: – Learn how to use Watson Speech to Text utilities to increase your accuracy – We've included links so you can download S2T utilities – Sample .wav I thought I would take a moment to play with Watson Speech to Text and a utility that was released a few months ago. The Speech to Text Utils allows you to train S2T using your existing conversational system. So the purpose of asking about a puppy is that I have a sample conversation system that is about buying a dog. From a demonstration from development, it was able to increase a S2T model accuracy from around 50% to over 80%.


Talking to your car is better than ever, thanks to Nuance's voice assistant

Mashable

Nuance Communications is the company responsible for the original voice recognition and speech technology for Siri, and the parent of one of the oldest and most-respected voice recognition systems, Dragon Naturally Speaking. One of the key themes, said Nuance Dragon Drive Solutions Marketing Manager Robert Policano, who was seated behind the driver in the Chrysler, is artificial intelligence and "how we use it on top of speech recognition and natural language processing to make it more contextual, offer more relevant results, and make it more humanized." When Ben-Gigi accepted, Dragon Drive pulled up gas stations with a focus on the brand he preferred. "While Apple is building an Apple experience, Nuance is helping BMW build the BMW experience," said Policano.


Google Now Employs AI to Help You Land a New Job

#artificialintelligence

As promised during its annual developer conference earlier this month, Google launched its search engine for jobs yesterday. Powered by artificial intelligence (AI), this specialized job search uses Google's Cloud Jobs API that launched back in 2016 as part of the company's "AI-first" approach. Instead of requiring users to download a new app, this new AI-powered tool allows job seekers in the U.S. to use Google's existing search function, which are accessible via desktop and mobile. However, Google won't handle any of the actual application process, and will simply direct you to the existing job application site.


'One machine learning model to rule them all': Google open-sources tools for simpler AI ZDNet

#artificialintelligence

Google researchers have created what they call "one model to learn them all" for training AI models in different tasks using multiple types of training data. Also, models are often trained on tasks from the same "domain", such as translation tasks being trained with other translation tasks. The model it created is trained on a variety of tasks, including image recognition, translation tasks, image captioning, and speech recognition. It also includes a library of datasets and models drawn from recent papers by Google Brain researchers.