Speed up model inference with Vertex AI Predictions' optimized TensorFlow runtime

Aug-6-2022, 19:45:51 GMT–#artificialintelligence

From product recommendations, to fraud detection, to route optimization, low latency predictions are vital for numerous machine learning tasks. That's why we're excited to announce a public preview for a new runtime that optimizes serving TensorFlow models on the Vertex AI Prediction service. This optimized TensorFlow runtime leverages technologies and model optimization techniques that are used internally at Google, and can be incorporated into your serving workflows without any changes to your training or model saving code. The result is faster predictions at a lower cost compared to the open source based pre-built TensorFlow serving containers. This post is a high-level overview of the optimized TensorFlow runtime that reviews some of its features, how to use it, and then provides benchmark data that demonstrates how it performs.

optimized tensorflow runtime, runtime, tensorflow runtime, (8 more...)

#artificialintelligence

Aug-6-2022, 19:45:51 GMT

News Web Page

Add feedback

Country:
- North America > United States (0.05)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found