Demystifying the Accuracy-Interpretability Trade-Off: A Case Study of Inferring Ratings from Reviews