Linear Recency Bias During Training Improves Transformers' Fit to Reading Times

Open in new window