MQLV: Modified Q-Learning for Vasicek Model