The Power of External Memory in Increasing Predictive Model Capacity