Federated Temporal Difference Learning with Linear Function Approximation under Environmental Heterogeneity

Open in new window