MA4DIV: Multi-Agent Reinforcement Learning for Search Result Diversification