Conservative and Reward-driven Behavior Selection in a Commonsense Reasoning Framework

Johnston, Benjamin (University of Technology, Sydney) | Williams, Mary-Anne (University of Technology, Sydney)

AAAI Conferences 

Comirit is a framework for commonsense reasoning that combines simulation, logical deduction and passive machine learning. While a passive, observation-driven approach to learning is safe and highly conservative, it is limited to inte-raction only with those objects that it has previously ob-served. In this paper we describe a preliminary exploration of methods for extending Comirit to allow safe action selection in uncertain situations, and to allow reward-maximizing selection of behaviors.

Duplicate Docs Excel Report

Title
None found

Similar Docs  Excel Report  more

TitleSimilaritySource
None found