Best practices for implementing machine learning on Google Cloud
Use BigQuery to process tabular data. Use Dataflow to process unstructured data. Use managed datasets to link data to your models. The recommended approach for processing your data depends on the framework and data types you're using. This section provides high-level recommendations for common scenarios. For general recommendations on data engineering and feature engineering for ML, see Data preprocessing for machine learning: options and recommendations and Data preprocessing for machine learning using TensorFlow Transform. If you're using TensorFlow for model development, use TensorFlow Extended to prepare your data for training. TensorFlow Transform is the TensorFlow component that enables defining and executing a preprocessing function to transform your data.
May-21-2021, 18:05:22 GMT