Large language models transition from integrating across position-yoked, exponential windows to structure-yoked, power-law windows

Neural Information Processing Systems 

Prior work suggests that human brain responses to language exhibit hierarchically organized "integration windows" that substantially constrain the