Quantifying the Impact of Non-Stationarity in Reinforcement Learning-Based Traffic Signal Control