TL;DR Baidu's TTS system now supports multi-speaker conditioning, and can learn new speakers with very little data (a la LyreBird). I'm really excited about the recent influx of neural-net TTS systems, but all of the them seem to be too slow for real time dialog, or not publicly available, or both. Hoping that one of them gets a high quality open-source implementation soon!
Emotion Detection and Recognition market to reach USD 22.65 billion by 2020 This study has been done on a global level broadly covering four regions, namely, North America, Europe, APAC, Middle East and RoW, and the market is projected to grow from USD 5.66 billion in 2015 to USD 22.65 billion by 2020, at a CAGR of 31.9% during the period. The market is being driven by factors such as increased focus on affective computing, business intelligence, and growing amount of spatial data as well as prompt availability of analytical tools. "Law Enforcement, Surveillance, and Monitoring areas are projected to showcase robust growth in the emotion detection and recognition market" The defense and security agencies require emotion detection technology for surveillance and monitoring purposes. Major implementation of this technology has already been done in the areas of military services such as lie detectors and polygraph tests. The emotion detection technology helps in matching the records in real-time and detecting the stress levels of a criminal.
Panasonic and Nuance have been close partners on TV voice recognition in the past; we now know that they're getting a bit cozier for Panasonic's 2013 Smart TVs. The engine will also speak out content and menus if you need more than just visual confirmation of where you're going. Panasonic's refreshed TV line is gradually rolling out over the spring, so those who see a plastic remote control as so very 2010 won't have long to wait. Panasonic's New Smart TVs Now Listen and Speak with Nuance's Dragon TV Panasonic's New SMART VIERA HDTVs Voice Interaction Lets People Find TV Content, Search the Web, Get Access to Apps and More with the Power of Dragon Now people can simply sit back and speak to find content, search the web, control volume and more – creating a more interactive and intelligent television experience. And with Dragon TV's text-to-speech, television content and options on the screen can be read aloud.
Mark Zuckerberg has ended 2016 having completed his personal challenge to build a Jarvis-style AI to run his home. He announced at the start of the year that he wanted to build a simple AI that could control his home, including his lights, temperature, appliances, music and security. He also wanted it to "learn his tastes and patterns, learn new words and concepts, and even entertain Max" (his daughter.) And now he has published a blog post explaining how he did it. Zuckerberg's Jarvis uses several artificial intelligence techniques, including natural language processing, speech recognition, face recognition, and reinforcement learning, written in Python, PHP and Objective C. "Before I could build any AI, I first needed to write code to connect these systems, which all speak different languages and protocols," the Facebook founder explained.