Data Processing for the OpenGPT-X Model Family