How to scale machine learning inference for multi-tenant SaaS use cases