Reviews: Regret Bounds for Thompson Sampling in Episodic Restless Bandit Problems