Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining