E-HBA: Using Action Policies for Expert Advice and Agent Typification

Albrecht, Stefano V., Crandall, Jacob W., Ramamoorthy, Subramanian

Jul-23-2019–arXiv.org Artificial Intelligence

Past research has studied two approaches to utilise predefined policy sets in repeated interactions: as experts, to dictate our own actions, and as types, to characterise the behaviour of other agents. In this work, we bring these complementary views together in the form of a novel meta-algorithm, called Expert-HBA (E-HBA), which can be applied to any expert algorithm that considers the average (or total) payoff an expert has yielded in the past. E-HBA gradually mixes the past payoff with a predicted future payoff, which is computed using the type-based characterisation. We present results from a comprehensive set of repeated matrix games, comparing the performance of several well-known expert algorithms with and without the aid of E-HBA. Our results show that E-HBA has the potential to significantly improve the performance of expert algorithms.

artificial intelligence, expert algorithm, machine learning, (16 more...)

arXiv.org Artificial Intelligence

Jul-23-2019

arXiv.org PDF

Add feedback

Country:
- Europe > United Kingdom (0.14)

Genre:
- Research Report > New Finding (0.69)

Technology:
- Information Technology
  - Game Theory (0.93)
  - Artificial Intelligence
    - Representation & Reasoning > Agents (0.95)
    - Machine Learning > Neural Networks (0.68)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found