Scaling Behavior for Large Language Models regarding Numeral Systems: An Example using Pythia

Open in new window