An Arbitration Control for an Ensemble of Diversified DQN variants in Continual Reinforcement Learning