Contextual Bandits for Evaluating and Improving Inventory Control Policies