Defining and Evaluating Physical Safety for Large Language Models