Collaborating Authors

How to use Watson Speech to Text utilities to increase accuracy - Watson


June 23, 2017 Written by: Simon O'Doherty Key Points: – Learn how to use Watson Speech to Text utilities to increase your accuracy – We've included links so you can download S2T utilities – Sample .wav I thought I would take a moment to play with Watson Speech to Text and a utility that was released a few months ago. The Speech to Text Utils allows you to train S2T using your existing conversational system. To give a quick demo, I got my son to ask about buying a puppy. Of course the recording is crystal clear, which is why such a good result.

Michael Keaton deserves an award for the Batman comment he made at a graduation speech


Michael Keaton was a damn good Batman. SEE ALSO: Holy prequel, Batman! The legendary actor gave a commencement speech at Kent State University graduation recently, and he wrapped things up in the most unbelievably wonderful way. Michael Keaton closed his commencement speech at Kent State with "I'm Batman." And this is why Michael Keaton is the best.

Speech Research Lab: Speech Synthesis, Speech Recognition, and Speech Processing

AITopics Original Links

The ModelTalker TTS system converts plain English text to speech. It uses a text to phoneme system which includes capabilities for parsing ToBI-like descriptions of the intonation. Synthesis is accomplished through a combination of database-driven speech and a variant on diphone-based phoneme to sound engines known as Biphone-Constrained Concatenation (BCC). Speech stored in the database encompasses common words and phrases in different contexts as well as a complete set of biphones. The BCC sound engine results in smoother, more natural speech, without sacrificing the ability to quickly "capture" new voices in the biphone inventories for the system.

Mozilla Deep Speech: An open source Speech-to-Text Engine


Open source Speech recognition Engine based on Tensor-flow. Deep-Speech is a source engine which is easily used by any individual as a Speech-To-Text (STT) engine; use to display the prepared machine learning strategies. Project Deep-Speech applies Google's Tensor Flow to generate better performance with fewer challenges. It is an engine that points to produce discourse recognition innovation and prepared models openly and accessible to engineers and it is additionally a profound learning-based Automatic Speech Recognition Engine (ASR) with a straightforward API. They moreover give pre-trained English models.

Jay-Z's mother gives a touching speech at GLAAD awards


On Jay-Z's latest album, 4:44, the rapper discussed his mother's sexuality for the first time, describing how she came out to him and the family late in life. The track is called "Smile," and it features a moving poem from Ms. Gloria Carter on living a life in the shadows for too long. During the 29th GLAAD media awards this weekend, Carter stepped out of those very shadows to accept the Special Recognition Award for her role in the creation of "Smile," and gave a touching acceptance speech and her first public remarks on her son's song. SEE ALSO: Beyoncé brings Jay-Z and a Destiny's Child reunion to'Beychella' "I'm old school," Carter said, starting off her acceptance speech. "I wrote a little something and just want to share it with you guys." "Thanks to my family, for loving me unconditionally no matter what," she continued.