Goto

Collaborating Authors

directory


Building a Speech-Enabled AI Virtual Assistant with NVIDIA Riva on Amazon EC2

#artificialintelligence

Speech AI can assist human agents in contact centers, power virtual assistants and digital avatars, generate live captioning in video conferencing, and much more. Under the hood, these voice-based technologies orchestrate a network of automatic speech recognition (ASR) and text-to-speech (TTS) pipelines to deliver intelligent, real-time responses. Building these real-time speech AI applications from scratch is no easy task. From setting up GPU-optimized development environments to deploying speech AI inferences using customized large transformer-based language models in under 300ms, speech AI pipelines require dedicated time, expertise, and investment. In this post, we walk through how you can simplify the speech AI development process by using NVIDIA Riva to run GPU-optimized applications.


Pytorch: Creating a Custom Dataset Class With Specific Data Transformations for Semantic…

#artificialintelligence

Whenever you work on a deep learning project with Pytorch, you might have to define your dataset class at some point. In my case, I work on a project using semantic segmentation to train a transformer model that can generalize geometric shapes (such as building footprints) on different scales. When working out my implementation, I found it hard to find specific examples treating semantic segmentation, which is why I decided to share some parts of my experience. In the following, I will show you how I set up my dataset class object and apply the wanted data transformations to the "input" and the "mask" data. Let's go through this code step by step.


Apache Spark in Python: Beginner's Guide

#artificialintelligence

In this article, we are going to explain about Apache Spark and python in more detail. Further you need a glance at this Pyspark Training course that will teach you the skills you'll need for becoming a professional Python Spark developer. Let's begin by understanding Apache Spark. Apache Spark is a framework based on open source which has been making headlines since its beginnings in 2009 at UC Berkeley's AMPLab; at its base it is an engine for distributed processing of big data that could expand at will. Simply put, as the volume of data increases, it becomes increasingly important to be able to handle enormous streams of data while still processing and doing other operations like machine learning, and Apache Spark can do just that. According to several experts, it will soon become the standard platform for streaming computation.


Detect and Blur human faces on your website

#artificialintelligence

Content moderation is very necessary for a website. If you are developing a website where users can upload images, then you have to be extra cautious. If they upload some objectionable content, then sometimes you as a creator of the site, become the victim of it. In every modern web 2.0 application, a content moderation system is present. Some popular websites like Facebook, Instagram, and Twitter have both automatic and manual content moderation systems in place.


GitHub - google-research/t5x

#artificialintelligence

T5X is a modular, composable, research-friendly framework for high-performance, configurable, self-service training, evaluation, and inference of sequence models (starting with language) at many scales. It is essentially a new and improved implementation of the T5 codebase (based on Mesh TensorFlow) in JAX and Flax. Below is a quick start guide for training models with TPUs on Google Cloud. For additional tutorials and background, see the complete documentation. Vertex AI is a platform for training that creates TPU instances and runs code on the TPUs.


Building speech controlled robot with Tensil and Arty A7 - Part I

#artificialintelligence

In this two-part tutorial we will learn how to build a speech controlled robot using Tensil open source machine learning (ML) acceleration framework and Digilent Arty A7-100T FPGA board. At the heart of this robot we will use the ML model for speech recognition. We will learn how Tensil framework enables ML inference to be tightly integrated with digital signal processing in a resource constrained environment of a mid-range Xilinx Artix-7 FPGA. Part I will focus on recognizing speech commands through a microphone. Part II will focus on translating commands into robot behavior and integrating with the mechanical platform. Let's start by specifying what commands we want the robot to understand. To keep the mechanical platform simple (and inexpensive) we will build on a wheeled chassis with two engines. The robot will recognize directives to move forward in a straight line (go!), turn in-place clockwise (right!) and counterclockwise (left!), and turn the engines off (stop!). Now that we know what robot we want to build, let's define its high-level system architecture. This architecture will revolve around the Arty board that will provide the "brains" for our robot. In order for the robot to "hear" we need a microphone. The Arty board provides native connectivity with the PMOD ecosystem and there is MIC3 PMOD from Digilent that combines a microphone with ADCS7476 analog-to-digital converter. And in order to control motors we need two HB3 PMOD drivers, also from Digilent, that will convert digital signals to voltage level and polarity to drive the motors.


Hosting Models with TF Serving on Docker

#artificialintelligence

Training a Machine Learning (ML) model is only one step in the ML lifecycle. There's no purpose to ML if you cannot get a response from your model. You must be able to host your trained model for inference. There's a variety of hosting/deployment options that can be used for ML, with one of the most popular being TensorFlow Serving. TensorFlow Serving helps take your trained model's artifacts and host it for inference.


100%OFF

#artificialintelligence

The following sentence is annoying, but also true: the best time to learn Git was yesterday. Fortunately, the second best time is today! Git is an essential tool for work in any code-related field, from data science to game development to machine learning. This course covers everything you need to know to start using Git and Github in the real-world today! The course's 20 sections are broken down into four separate units: We start off with Git Essentials. The goal of this unit is to give you all the essential Git tools you need for daily use.


Data Collection Using OpenCV

#artificialintelligence

Originally published on Towards AI the World's Leading AI and Technology News and Media Company. If you are building an AI-related product or service, we invite you to consider becoming an AI sponsor. At Towards AI, we help scale AI and technology startups. Let us help you unleash your technology to the masses. OpenCV, short for "Open Source Computer Vision", is a machine learning library that was designed to enable image processing and computer vision applications.


Disease Classification using Medical MNIST

#artificialintelligence

The objective of this study is to classify medical images using the Convolutional Neural Network(CNN) Model. Here, I trained a CNN model with a well-processed dataset of medical images. This model can be used to classify medical images based on categories provided as per the training dataset. This dataset was developed in 2017 by Arturo Polanco Lozano. It is also known as the MedNIST dataset for radiology and medical imaging. For the preparation of this dataset, images have been gathered from several datasets, namely, TCIA, the RSNA Bone Age Challange, and the NIH Chest X-ray dataset.