Goto

Collaborating Authors

 tensorflow runtime


Speed up model inference with Vertex AI Predictions' optimized TensorFlow runtime

#artificialintelligence

From product recommendations, to fraud detection, to route optimization, low latency predictions are vital for numerous machine learning tasks. That's why we're excited to announce a public preview for a new runtime that optimizes serving TensorFlow models on the Vertex AI Prediction service. This optimized TensorFlow runtime leverages technologies and model optimization techniques that are used internally at Google, and can be incorporated into your serving workflows without any changes to your training or model saving code. The result is faster predictions at a lower cost compared to the open source based pre-built TensorFlow serving containers. This post is a high-level overview of the optimized TensorFlow runtime that reviews some of its features, how to use it, and then provides benchmark data that demonstrates how it performs.


Getting Started with TensorFlow: A Machine Learning Tutorial

#artificialintelligence

Over time, TensorFlow has grown in popularity and is now being used by developers for solving problems using deep learning methods for image recognition, video detection, text processing like sentiment analysis, etc. Like any other library, you may need some time to get used to the concepts that TensorFlow is built on. And, once you do, with the help of documentation and community support, representing problems as data graphs and solving them with TensorFlow can make machine learning at scale a less tedious process. In TensorFlow, constants are created using the constant function which takes a few parameters: Value, dtype (data type), shape, name and (verify_shape) shape verification.