Lessons from the trenches on evaluating machine-learning systems in materials science