The Evolution of Reinforcement Learning in Quantitative Finance