How Machine Learning Pipelines Work and What Needs Improving - The New Stack
This article is a post in a series on bringing continuous integration and deployment (CI/CD) practices to machine learning. Check back to The New Stack for future installments. The pipeline runs from ingesting and cleaning data, through feature engineering and model selection in an interactive workbench environment, to training and experiments, usually with the option to share results, to deploying the trained model, to serving results like predictions and classifications. The machine learning development and deployment pipelines are often separate, but unless the model is static, it will need to be retrained on new data or updated as the world changes, and updated and versioned in production, which means going through several steps of the pipeline again and again. Managing the complexity of these pipelines is getting harder, especially when you're trying to use real-time data and update models frequently.
Mar-6-2019, 21:19:15 GMT