Federated UCBVI: Communication-Efficient Federated Regret Minimization with Heterogeneous Agents