Provably Adaptive Average Reward Reinforcement Learning for Metric Spaces