Data Management For Large Language Models: A Survey