Deep reinforcement learning for optimal trading with partial information