The German Commons - 154 Billion Tokens of Openly Licensed Text for German Language Models

Open in new window