Decentralized multi-agent reinforcement learning algorithm using a cluster-synchronized laser network