Pushing the Performance Envelope of DNN-based Recommendation Systems Inference on GPUs

Open in new window