Federated UCBVI: Communication-Efficient Federated Regret Minimization with Heterogeneous Agents

Open in new window