Systematic Generalization in Language Models Scales with Information Entropy

Open in new window