Multi-agent reinforcement learning strategy to maximize the lifetime of Wireless Rechargeable