Socially Responsible Data for Large Multilingual Language Models