Between Circuits and Chomsky: Pre-pretraining on Formal Languages Imparts Linguistic Biases

Open in new window