Can LLMs Solve and Generate Linguistic Olympiad Puzzles?
Majmudar, Neh, Filatova, Elena
–arXiv.org Artificial Intelligence
In this paper, we introduce a combination of novel and exciting tasks: the solution and generation of linguistic puzzles. We focus on puzzles used in Linguistic Olympiads for high school students. We first extend the existing benchmark for the task of solving linguistic puzzles. We explore the use of Large Language Models (LLMs), including recent state-of-the-art models such as OpenAI's o1, for solving linguistic puzzles, analyzing their performance across various linguistic topics. We demonstrate that LLMs outperform humans on most puzzles types, except for those centered on writing systems, and for the understudied languages. We use the insights from puzzle-solving experiments to direct the novel task of puzzle generation. We believe that automating puzzle generation, even for relatively simple puzzles, holds promise for expanding interest in linguistics and introducing the field to a broader audience. This finding highlights the importance of linguistic puzzle generation as a research task: such puzzles can not only promote linguistics but also support the dissemination of knowledge about rare and understudied languages.
arXiv.org Artificial Intelligence
Sep-29-2025
- Country:
- Africa > Middle East
- Europe
- Bulgaria > Sofia City Province
- Sofia (0.04)
- Greece (0.05)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Middle East > Malta
- Eastern Region > Northern Harbour District > St. Julian's (0.04)
- Russia > Central Federal District
- Moscow Oblast > Moscow (0.04)
- United Kingdom (0.04)
- Bulgaria > Sofia City Province
- North America > United States
- Florida > Miami-Dade County > Miami (0.04)
- Genre:
- Overview (0.93)
- Research Report > Promising Solution (0.34)
- Industry:
- Education > Educational Setting > K-12 Education > Secondary School (0.54)
- Technology: