Deep Reinforcement Learning for Long Term Hydropower Production Scheduling