Open Problem: Order Optimal Regret Bounds for Kernel-Based Reinforcement Learning