Causal language modeling can elicit search and reasoning capabilities on logic puzzles

May-27-2025, 03:53:25 GMT–Neural Information Processing Systems

Causal language modeling using the Transformer architecture has yielded remarkable capabilities in Large Language Models (LLMs) over the last few years. However, the extent to which fundamental search and reasoning capabilities emerged within LLMs remains a topic of ongoing debate. In this work, we study if causal language modeling can learn a complex task such as solving Sudoku puzzles. To solve a Sudoku, the model is first required to search over all empty cells of the puzzle to decide on a cell to fill and then apply an appropriate strategy to fill the decided cell. Sometimes, the application of a strategy only results in thinning down the possible values in a cell rather than concluding the exact value of the cell.

artificial intelligence, large language model, natural language, (9 more...)

Neural Information Processing Systems

May-27-2025, 03:53:25 GMT

Conferences Web Page

Add feedback

Industry:
- Leisure & Entertainment > Games > Sudoku (0.54)

Technology:
- Information Technology > Artificial Intelligence > Natural Language
  - Large Language Model (1.00)
  - Chatbot (0.87)