Signatures of human-like processing in Transformer forward passes
Hu, Jennifer, Lepori, Michael A., Franke, Michael
–arXiv.org Artificial Intelligence
Modern AI models are increasingly being used as theoretical tools to study human cognition. One dominant approach is to evaluate whether human-derived measures are predicted by a model's output: that is, the end-product of a forward pass. However, recent advances in mechanistic interpretability have begun to reveal the internal processes that give rise to model outputs, raising the question of whether models might use human-like processing strategies. Here, we investigate the relationship between real-time processing in humans and layer-time dynamics of computation in Transformers, testing 20 open-source models in 6 domains. We first explore whether forward passes show mechanistic signatures of competitor interference, taking high-level inspiration from cognitive theories. We find that models indeed appear to initially favor a competing incorrect answer in the cases where we would expect decision conflict in humans. We then systematically test whether forward-pass dynamics predict signatures of processing in humans, above and beyond properties of the model's output probability distribution. We find that dynamic measures improve prediction of human processing measures relative to static final-layer measures. Moreover, across our experiments, larger models do not always show more human-like processing patterns. Our work suggests a new way of using AI models to study human cognition: not just as a black box mapping stimuli to responses, but potentially also as explicit processing models.
arXiv.org Artificial Intelligence
May-20-2025
- Country:
- Africa > Middle East
- Morocco
- Casablanca-Settat Region > Casablanca (0.04)
- Marrakesh-Safi Region > Marrakesh (0.04)
- Morocco
- Asia
- Middle East
- Jordan (0.04)
- UAE > Abu Dhabi Emirate
- Abu Dhabi (0.04)
- Thailand > Bangkok
- Bangkok (0.04)
- Middle East
- Europe
- Austria > Vienna (0.14)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- Germany > Baden-Württemberg
- Tübingen Region > Tübingen (0.14)
- Italy > Tuscany
- Florence (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- North America
- Canada > Ontario
- Toronto (0.04)
- Mexico > Mexico City
- Mexico City (0.04)
- United States
- Illinois
- Cook County > Chicago (0.04)
- Sangamon County > Springfield (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Maryland > Anne Arundel County
- Annapolis (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Pennsylvania (0.04)
- Illinois
- Canada > Ontario
- Africa > Middle East
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Government > Regional Government (0.46)
- Health & Medicine (0.93)
- Technology:
- Information Technology > Artificial Intelligence
- Cognitive Science (1.00)
- Machine Learning > Neural Networks
- Deep Learning (1.00)
- Natural Language
- Chatbot (0.95)
- Large Language Model (1.00)
- Representation & Reasoning (0.67)
- Vision (1.00)
- Information Technology > Artificial Intelligence