Data-driven battery operation for energy arbitrage using rainbow deep reinforcement learning