PRobELM: Plausibility Ranking Evaluation for Language Models