What Makes an Evaluation Useful? Common Pitfalls and Best Practices