New NVIDIA Pascal GPUs Accelerate Deep Learning Inference
BEIJING, CHINA--(Marketwired - Sep 12, 2016) - GPU Technology Conference China - NVIDIA (NASDAQ: NVDA) today unveiled the latest additions to its Pascal architecture-based deep learning platform, with new NVIDIA Tesla P4 and P40 GPU accelerators and new software that deliver massive leaps in efficiency and speed to accelerate inferencing production workloads for artificial intelligence services. Modern AI services such as voice-activated assistance, email spam filters, and movie and product recommendation engines are rapidly growing in complexity, requiring up to 10x more compute compared to neural networks from a year ago. Current CPU-based technology isn't capable of delivering real-time responsiveness required for modern AI services, leading to a poor user experience. The Tesla P4 and P40 are specifically designed for inferencing, which uses trained deep neural networks to recognize speech, images or text in response to queries from users and devices. Based on the Pascal architecture, these GPUs feature specialized inference instructions based on 8-bit (INT8) operations, delivering 45x faster response than CPUs1 and a 4x improvement over GPU solutions launched less than a year ago.2
Jun-19-2017, 02:25:34 GMT
- Country:
- North America > United States (0.30)
- Asia > China
- Genre:
- Press Release (0.68)
- Industry:
- Information Technology > Hardware (0.98)
- Banking & Finance > Trading (0.72)
- Technology: