Building a Speech-Enabled AI Virtual Assistant with NVIDIA Riva on Amazon EC2
Speech AI can assist human agents in contact centers, power virtual assistants and digital avatars, generate live captioning in video conferencing, and much more. Under the hood, these voice-based technologies orchestrate a network of automatic speech recognition (ASR) and text-to-speech (TTS) pipelines to deliver intelligent, real-time responses. Building these real-time speech AI applications from scratch is no easy task. From setting up GPU-optimized development environments to deploying speech AI inferences using customized large transformer-based language models in under 300ms, speech AI pipelines require dedicated time, expertise, and investment. In this post, we walk through how you can simplify the speech AI development process by using NVIDIA Riva to run GPU-optimized applications.
Aug-9-2022, 09:00:49 GMT