Talking Heads: Understanding Inter-layer Communication in Transformer Language Models
–Neural Information Processing Systems
These failure cases are particularly troubling because they are not systematic; it is very difficult to predict when, for example, the order of information seemingly randomly causes a model to fail [Pezeshkpour and Hruschka, 2023, Liu et al., 2024, Li and Gao, 2024, Zheng et al.,
Neural Information Processing Systems
Feb-15-2026, 18:54:03 GMT
- Country:
- Europe > Germany > Baden-Württemberg > Tübingen Region > Tübingen (0.14)
- Genre:
- Research Report > Experimental Study (1.00)
- Industry:
- Health & Medicine (0.67)
- Technology: