Pushing the Performance Envelope of DNN-based Recommendation Systems Inference on GPUs