Deploying Deep Neural Networks with NVIDIA TensorRT

Apr-15-2017, 05:32:26 GMT–#artificialintelligence

NVIDIA TensorRT is a high-performance deep learning inference library for production environments. Power efficiency and speed of response are two key metrics for deployed deep learning applications, because they directly affect the user experience and the cost of the service provided. Tensor RT automatically optimizes trained neural networks for run-time performance, delivering up to 16x higher energy efficiency (performance per watt) on a Tesla P100 GPU compared to common CPU-only deep learning inference systems (see Figure 1). Figure 2 shows the performance of NVIDIA Tesla P100 and K80 running inference using TensorRT with the relatively complex GoogLenet neural network architecture. In this post we will show you how you can use Tensor RT to get the best efficiency and performance out of your trained deep neural network on a GPU-based deployment platform.

artificial intelligence, machine learning, tensorrt, (14 more...)

#artificialintelligence

Apr-15-2017, 05:32:26 GMT

News Web Page

Add feedback

Country:
- North America > United States > California > Santa Clara County > San Jose (0.05)

Industry:
- Information Technology > Hardware (0.83)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found