Beyond Single Stationary Policies: Meta-Task Players as Naturally Superior Collaborators

May-31-2025, 09:51:58 GMT–Neural Information Processing Systems

In human-AI collaborative tasks, the distribution of human behavior, influenced by mental models, is non-stationary, manifesting in various levels of initiative and different collaborative strategies. A significant challenge in human-AI collaboration is determining how to collaborate effectively with humans exhibiting non-stationary dynamics. Current collaborative agents involve initially running self-play (SP) multiple times to build a policy pool, followed by training the final adaptive policy against this pool. These agents themselves are a single policy network, which is insufficient for handling non-stationary human dynamics. We discern that despite the inherent diversity in human behaviors, the underlying meta-tasks within specific collaborative contexts tend to be strikingly similar.

artificial intelligence, bayesian inference, machine learning, (18 more...)

Neural Information Processing Systems

May-31-2025, 09:51:58 GMT

Conferences PDF

Add feedback

Country:
- Asia (0.28)
- North America > United States
  - Massachusetts (0.14)

Genre:
- Research Report
  - Experimental Study (1.00)
  - New Finding (1.00)

Industry:
- Leisure & Entertainment > Games (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning (1.00)
  - Representation & Reasoning
    - Agents > Agent Societies (0.66)
    - Uncertainty > Bayesian Inference (0.46)

Duplicate Docs Excel Report

Title
Beyond Single Stationary Policies: Meta-Task Players as Naturally Superior Collaborators

Similar Docs Excel Report more

Title	Similarity	Source
None found