Iterative Inference in a Chess-Playing Neural Network
Sandmann, Elias, Lapuschkin, Sebastian, Samek, Wojciech
–arXiv.org Artificial Intelligence
Do neural networks build their representations through smooth, gradual refinement, or via more complex computational processes? We investigate this by extending the logit lens to analyze the policy network of Leela Chess Zero, a superhuman chess engine. Although playing strength and puzzle-solving ability improve consistently across layers, capability progression occurs in distinct computational phases with move preferences undergoing continuous reevaluation--move rankings remain poorly correlated with final outputs until late, and correct puzzle solutions found in middle layers are sometimes overridden. This late-layer reversal is accompanied by concept preference analyses showing final layers prioritize safety over aggression, suggesting a mechanism by which heuristic priors can override tactical solutions.
arXiv.org Artificial Intelligence
Nov-26-2025
- Country:
- Europe > Netherlands (0.04)
- Genre:
- Research Report > New Finding (0.92)
- Industry:
- Leisure & Entertainment > Games > Chess (1.00)
- Technology: