CoSafe: Evaluating Large Language Model Safety in Multi-Turn Dialogue Coreference