Which Word Orders Facilitate Length Generalization in LMs? An Investigation with GCG-Based Artificial Languages

Open in new window