Not quite Sherlock Holmes: Language model predictions do not reliably differentiate impossible from improbable events