Reinforcement Learning from Statistical Feedback: the Journey from AB Testing to ANT Testing