Are LLMs Good Cryptic Crossword Solvers?
Sadallah, Abdelrahman "Boda", Kotova, Daria, Kochmar, Ekaterina
–arXiv.org Artificial Intelligence
Cryptic crosswords are puzzles that rely not only on general knowledge but also on the solver's ability to manipulate language on different levels and deal with various types of wordplay. Previous research suggests that solving such puzzles is a challenge even for modern NLP models. However, the abilities of large language models (LLMs) have not yet been tested on this task. In this paper, we establish the benchmark results for three popular LLMs -- LLaMA2, Mistral, and ChatGPT -- showing that their performance on this task is still far from that of humans.
arXiv.org Artificial Intelligence
Mar-15-2024
- Country:
- Europe > Germany
- Saarland > Saarbrücken (0.04)
- North America > United States
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- New York > New York County
- New York City (0.04)
- Massachusetts > Middlesex County
- Oceania > New Zealand
- North Island > Auckland Region > Auckland (0.04)
- Europe > Germany
- Genre:
- Research Report > New Finding (1.00)
- Technology: