The Lucie-7B LLM and the Lucie Training Dataset: Open resources for multilingual language generation