Reward Model Interpretability via Optimal and Pessimal Tokens