PREBA: A Hardware/Software Co-Design for Multi-Instance GPU based AI Inference Servers

Open in new window