Deep reinforcement learning for quantum multiparameter estimation