ChEmREF: Evaluating Language Model Readiness for Chemical Emergency Response