Beyond Single Stationary Policies: Meta-Task Players as Naturally Superior Collaborators

Neural Information Processing Systems 

We provide theoretical guarantees for CBPR's rapid convergence to the optimal policy once human partners alter their policies.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found