Collaborating Authors

Speech Recognition

How to build your own AI-powered voice assistant


Ever wondered how Google assistant and Siri can speak with us exactly like humans. This is the magic of Deep Learning. So without wasting time let's jump directly to the topic. The above diagram will help you to get an overview of how the process happens inside the voice assistant. First I will explain each process in-depth and in the end, I will summarise the entire process with the help of an example.

McDonald's Drive-Thru Employees, Replaced By Artificial Intelligence


There's creative AI and then there's the hard-working AI – the artificial intelligence that is able to replace humans in routine work, saving up costs and allowing the employees to take charge of more complex tasks. That is the AI McDonald's is already testing in drive-thrus in the U.S. and looking to implement on a larger scale soon. A few years ago, McDonald's started to test the technology in the hope that it might be ready one day to take over at drive-thru locations. They had help from Apprente, the startup that gave them the building blocks of the technology, enabling them to build their own voice assistant. Now, the AI system is in place at 10 drive-thrus in Chicago.

Council Post: 15 Ways To Leverage AI In Customer Service


"The customer always comes first"--it's a business mantra as old as time, but it's more relevant now than ever before. These days, the businesses that know their customers well enough and cater to their needs and lifestyles accordingly, come out on top. With artificial intelligence (AI) advancing at phenomenal rates, there are so many ways for businesses to use it to learn more about their customers and provide the support they're looking for. From gathering data to speech recognition and message response times, AI can enhance the customer experience in nearly every way when it's applied correctly. Here, 15 members of Forbes Business Council share their expert insight on how organizations can leverage AI to enhance their customer service.

Ethics in voice technologies


The ethical use of voice technologies, such as speech and voice recognition, is becoming more important every day. Devices such as smart speakers, smartphones or smartwatches collect massive amounts of data from users thanks to the wide range of activities they allow (e.g., asking questions, setting reminders, checking bank accounts, accessing calendars, etc.). This data, as you might imagine, is often personal or private by nature. Companies offering services through these gadgets now have to assure not only a legal processing of user's data but also an ethical one. The above issue is not the only one that concerns ethics.

MLCommons Releases MLPerf Training v1.0 Results


Today, MLCommons, an open engineering consortium, released new results for MLPerf Training v1.0, the organization's machine learning training performance benchmark suite. MLPerf Training measures the time it takes to train machine learning models to a standard quality target in a variety of tasks including image classification, object detection, NLP, recommendation, and reinforcement learning. In its fourth round, MLCommons added two new benchmarks to evaluate the performance of speech-to-text and 3D medical imaging tasks. MLPerf Training is a full system benchmark, testing machine learning models, software, and hardware. With MLPerf, MLCommons now has a reliable and consistent way to track performance improvement over time, plus results from a "level playing field" benchmark drives competition, which in turn is driving performance.

Ambient Cloud Tech in Healthcare


The advent of Electronic Health Record systems and their accompanying documentation has created a deep fissure within the medical community. Epidemic-level numbers show that more and more physicians report feeling burnt out and depressed. The overall rate of work-life happiness reported by healthcare providers dropped below 50% thanks to the pandemic. Numbers released in Medscape's 2021 physician lifestyle report state that 43% of all physicians report feeling burnt out. Of those burnt-out physicians, 58% say they feel that way due to the long list of bureaucratic tasks like note taking and EHR documentation.

Artificial Intelligence, Part I - Basics.


Compared with humans, existing AI lacks several features (yes, i'm not sure about that word either Wikipedia) of human "commonsense reasoning" (as if commonsense was a thing lol); most notably, humans have powerful mechanisms for reasoning about "naive physics" such as space, time, and physical interactions (thankfully, we haven't worked on something like a humanoid robot which needed powerful mechanisms for reasoning about "naive physics"; we have just worked just on Digital Strategy (see what I'm doing here?), Chatbot Marketing (FYI you can try our chatbot very easily by sending us a message through Facebook Messenger or through our Website), and things like Incrediworld, Incredilosophy & Delphi - our Voice Recognition AI using Python, Numpy, SKLearn, GTTS, PYAudio, Speech Recognition & NLTK -.

What is Automatic Speech Recognition?


Speech recognition is concerned with understanding human communication and recognizing and translating it into texts by computers. It is also referred to as speech-to-text translation as it converts human speeches into a text-based format. ASR (Automated speech recognition) combined with IVR (interactive voice responses) can enable users to speak responses instead of typing them or pressing a button on their phones. The speaker-dependent systems are structured in such a way that they need to be trained, which are sometimes referred to as enrollment as well. It's working is pretty basic, the speaker needs to read the text or a series of isolated vocabulary into the system. The system will then process these recordings and associate them with text libraries.

McDonald's is replacing human drive-thru attendants with AI


This article was originally published on our sister site, Freethink. As if drive-through ordering wasn't frustrating enough already, now we might have a Siri-like AI to contend with. McDonald's just rolled out a voice recognition system at 10 drive-throughs in Chicago, expanding from the solitary test store they launched a few years ago. But when will it come to your neighborhood Golden Arches? "There is a big leap between going from 10 restaurants in Chicago to 14,000 restaurants across the U.S. with an infinite number of promo permutations, menu permutations, dialect permutations, weather -- I mean, on and on and on and on," admitted McDonald's CEO Chris Kempczinski, reports Nation's Restaurant News.

AI Guru Raj Reddy honoured by Silicon Valley's Computer History Museum.


The Computer History Museum (CHM) in Silicon Valley has honored Raj Reddy, an Indian-American professor and researcher, as part of its 2021 Fellow Awards program to contribute to artificial intelligence and continuous voice recognition. The other three recipients for 2021 were Raymond Ozzie, Lillian F. Schwartz, and Andries van Dam. Reddy, who grew up in the Andhra Pradesh district of Chittoor, has been teaching for five decades and is the founder of The Robotics Institute at Carnegie Mellon University in Pittsburg, Pennsylvania. The AI pioneer was also a driving force behind the Rajiv Gandhi University of Knowledge Technology establishment in Nuzvid, Andhra Pradesh. "New technologies have made it easier than ever to share knowledge and information and reach new audiences in this digital world," Reddy added.