Multi-model Machine Learning Inference Serving with GPU Spatial Partitioning