Word class representations spontaneously emerge in a deep neural network trained on next word prediction

Open in new window