A Practical Guide for Evaluating LLMs and LLM-Reliant Systems